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Chapter 1 



Introduction 



We present here various results, which may one day be published in a bigger paper, and which we 
wish to make already available to the Community. 

We investigate several technical and conceptual questions. 

Our main subject is the investigation of indcpcndcnce as a ternary relation in the context of non- 
monotonic logic. In the context of probability, this investigation was started by W. Spohn et al., 
and then followed by J. Pearl. We look at products of function sets, and thus continue our own 
investigation of independence in non-monotonic logic. We show that a finite characterization of 
this relation in our context is impossible, and indicate how to construct all valid rules. 
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Chapter 2 

Countably many disjoint sets 



We show here that - independent of the cardinality of the language - one can define only countably 
many inconsistent formulas. 

The question is due to D. Makinson (personal communication). 

We show here that, independent of the cardinality of the language, one can define only countably 
many inconsistent formulas. 

The problem is due to D. Makinson (personal communication). 
Example 2.0.1 

Thcre is a countably infinite set of formulas s.t. the defined model sets are pairwise disjoint. 
Let pi : i 6 lj be propositional variables. 
Consider := /\{^Pj : j < i} A pi for i E lü. 
Obviously, M (</>;) ^ for all i. 

Let i < i'; we show M(&) n M(&,) = 0. M(4> 1 ,) |= -y u M(^) |= Pi . 
□ 



Fact 2.0.1 

Any set X of consistent formulas with pairwise disjoint model sets is at most countable 
Proof 

Let such X be given. 

(1) We may assume that X consists of conjunctions of propositional variables or their negations. 

Proof: Rewrite all cf) e X as disjunctions of conjunctions <j>j. At least one of the conjunctions <pj is 
consistent. Replace <j> by one such <f>j. Consistency is preserved, as is pairwise disjointness. 

9 
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(2) Let X be such a set of formulas. Lct Xi C X be the set of formulas in X with length i, i.e., a 
consistent conjunetion of i many propositional variables or their negations, i > 0. 

As the model sets for X are pairwise disjoint, the modcl sets for all <ß G Xi have to be disjoint. 

(3) It sufhees now to show that each Xi is at most countable; we even show that each Xi is finite. 
Proof by induetion: 

Consider i — 1. Lct <p, <p' € X\. Let be p or ->p. If ^' is not -«f>, then and <ß' have a common 
modcl. So one must be p, the other -ip. But these are all possibilities, so card(Xi) is finite. 

Let the result be shown for k < i. 

Consider now Xi. Take arbitrary <f> £ X^. Without loss of generality, let 4> = Pi A • • • A Pi- Take 
arbitrary <fi' ^ <f>. As M((p) flM(0') = 0, 0' must be a conjunetion containing one of ~^pk, 1 < k < i. 
Consider now Xi : k ■= {</>' E Xi : <j>' contains ~^pk}- Thus Xi = {$} U {J{Xi y k ■ 1 < k < i}. Note 
that all iß, iß' £ Xi t k agree on ~^pk, so the Situation in X^k is isomorphic to Xi-\. So, by induetion 
hypothesis, card(Xi t k) is finite, as all <f>' £ have to be mutually inconsistent. Thus, card(Xi) 
is finite. (Note that we did not use the fact that elements from different Xi.k, X it k> also have to 
be mutually inconsistent; our rough proof suffices.) 

□ 



Note that the proof depends very littlc on logic. We needed normal forms, and used two truth 
values. Obviously, we can easily generalize to finitely many truth values. 



Chapter 3 

Independence as ternary relation 



3.1 Introduction 



3.1.1 Independence 

Independence is a central concept of reasoning. 

In the context of non-monotonic logic and related areas like theory revision, it was perhaps first 
investigated formally by R. Parikh and co-authors, see e.g. |Par96j . to obtain "local" conflict 
Solution. 

The present authors investigated its role for interpolation in preferential logics in |GS10) . and 
showed connections to abstract multiplication of size. 

Independence plays also a central role for a FOL treatment of preferential logics, where problems 
like the "dark haired Swedes" have to be treated. This is still subject of ongoing research. 

J. Pearl investigated independence in graphs and pobabilistic reasoning, e.g. in }Pea88j . also as a 
ternary relation, (X \ Y \ Z). 

The aim of the present paper is to extend this abstract approach to the preferential Situation. We 
should emphasize that this is only an abstract description of the independence relation, and thus 
not the same as independence for non-monotonic interpolation as examined in [GSlOj . where we 
used independence, essentially in the form of the multiplicative law fj,(X X Y) = ß(X) x ß(Y), 
which says that the \x— function preserves independence. 

We have not investigated if an interesting form of interpolation results from some application of ß 
to situations described by (X \ Y \ Z), analogously to above application of /x to situations described 
by(X\\Y). 

11 
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3.1.2 Overview 
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We will first discuss simple examples, to introduce the main ideas. 

We then present the basic definitions formally, for probabilistic and set independence. 

We then show basic results for set independence as a ternary relation, and turn to our main results, 
absence of finite characterization, and construetion of new rules for this ternary relation. 

3.1.3 Discussion of some simple examples 

We consider here X = Y = Z = W = {0, 1} and their produets. We will later generalize, but the 
main ideas stay the same. First, we look atlxZ (the Cartesian produet of X with Z), then at 
X x Z x W, at X x Y x Z, finally at X x Y x Z x W. Elements of these produets, i.e., sequences, 
will be written for simplicity 00, 01, 10, etc., context will disambiguate. General sequences will 
often be written er, r, etc. We will also look at subsets of these produets, like {00, 11} C X x Z, 
and various probability measures on these produets. 

As a matter of fact, the main part of this article concerns subsets A of produets X\ x . . . x X n and 
a suitable notion of independence for A, roughly, if we can write A as A\ x . . . x A m . This will be 
made more precise and discussed in progressively more complicated cases in this section. 

In the context of preferential struetures, A is intended to be fJb{X\ x . . . x X n ), the set of minimal 
models of X\ x . . . x X n . 

3.1.3.1 X x Z 

Let P : X x Z — > [0, 1] be a (fixed) probability measure. 
If A C X x Z, we will set P(A) := £{P(cr) : a G A). 

If A x := {er e X x Z : <r(X) = x}, we will write P{x) for P(A X ), likewise P(z) for P{A Z ), 
if A z := {er 6 X x Z : a(Z) = z}. When these are ambiguous, we will e.g. write Ax=o for 
{er e X x Z : a(X) = 0}, and P(X = 0) for P(A x =o), etc. 

We say that X and Z are independent for this P iff for all xz E X x Z P(xz) = P{x) * P(z). 
We write then (X \ \ Z)p, and call this and its variants probabilistic independence. 

Example 3.1.1 

(1) 

P(00) = P(01) = 1/6, P(10) = P(ll) = 1/3. 

Then P(X = 0) = 1/6 + 1/6 = 1/3, and P(X 
P(Z = 1) = 1/2, so (X\\Z) P . 

(2) 

P(00) = P(ll) = 1/3, P(01) = P(10) = 1/6. 

Then P(X = 0) = P(X = 1) = P(Z = 0) = P(Z 
so ->(X || Z) P . 



= 1) = 2/3, P(Z = 0) = 1/6 + 1/3 = 1/2, and 



= 1) = 1/2, but P(00) = 1/3^1/2*1/2 = 1/4, 



Definition 3.1.1 



Heiteres 

Consider now ^ A C X x Z for general X, Z. 
Definc the following probability measure on X x Z 
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Pa(ct) := 




Example 3.1.2 

(1) 

A := {00,01}, 

then P A (00) = P A (0l) = 1/2, P A (10) = P A (ll) = 0, P A (X = 0) = 1, Pa(X = 1) = 0, P A (Z = 
0) = Pa(Z = 1) = 1/2, and we have (X \\ Z) Pa . 

(2) 

4:= {00, 11}, 

then Pa(00) - Pa(H) = 1/2, P A (0l) = P A (10) = 0, P A (X = 0) = P A (X = 1) = 1/2, P A (Z = 
0) = P A (Z = 1) = 1/2, but Pa(00) = 1/2 i= P A (X = 0) * P A (Z = 0) = 1/4, and we have 
-<*I|Z>p a . 

(3) 

A := {00,01,11}, 

then P A (00) = P A (01) = P A {ll) = 1/3, P4(10) = 0, P A (X = 0) = 2/3, P A (A = 1) = 1/3, 
P A (Z = 0) = 1/3, P A (Z = 1) = 2/3, but Pa(00) = 1/3 ^ P A (A = 0) * P A (Z = 0) = 2/3 * 1/3 = 
2/9, and we have -■(X || Z)p A . 

Note that in (1) above, A = {0} x {0, 1}, but neither in (2), nor in (3), A can be written as such 
a produet. This is no coincidcnce, as we will see now. 

More formally, we write (X || Z) A iff for all ctt G A there is p G vi such that p(A) = cr(X) and 
p{Z) = t(Z), or, equivalently, that A = {ct(A) : a e i} x ■ o G A}, mcaning that we can 

combinc fragments of funetions in A arbitrarily. 

We call this and its variants set independence. 
Fact 3.1.1 

Consider above Situation X x Z. Then (X || Z) Pa iff (X || Z) A . 



A C {cr(A) : er G ^4} x {<j(Z) : a G A} is trivial. Suppose Pa(2;, z) = PaM * PaC^), but there are 
er, t G A, <j{X)t(Z) A Then P A (x),P A (z) > 0, but P A (x, z) = 0, a contradiction. 



Proof 



Case 1: Pa(2;) = 0, then P A (x, z 



) = 0, and we are done. Likewise for P A (Z) = 0. 
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Case 2: P A (x),P A (z) > 0. 
By definition and prerequisite, 

' aeA:a(X)=x} _ c 

card(A) card(A) 



P A {x) card{a^iA:a(X)— x} card{(r(Z):<T£A} 



p / \ card{a^A:(r(Z)—z} card{a(X) :<r£ Ä] 

^A(Z) — card(A) ~ card(A) ' 

p / \ card{a^A:a(X)—x,a(Z)—z} _ 1 

^A{X, Z) — card(A) — card{A) ' 

By prerequisite again, card(A) = card{a(X) : a G A} = card{a(Z) : a G A}, so card l°$ A ° eA} * 

card{a(X):c7eA} _ 1 

card(A) ~ card(A) 

□ 



3.1.3.2 X x Z xW 

Herc, W will not be mentioned directly. 

Let P : X x Z x W — > [0, 1] be a probability measure. 

Again, we say that X and Z arc independent for P, (X || Z)p, iff for all i e 1, z e Z P(x, z) = 
P(x)*P{z). 

Example 3.1.3 

(1) 

Let P(000) = P(001) = P(010) = P(011) = 1/12, P(100) = P(101) = P(110) = P(lll) = 1/6, 
then X and Z are independent. 

(2) 

Let P(100) = P(101) = P(010) = P(011) = 1/12, P(000) = P(001) = P(110) = P(lll) = 1/6, 
then P(X = 0) = P(X = 1) = P(Z = 0) = P(Z = 1) = 1/2, but P(X = 0,Z = 0) = 1/3 ^ 
1/2*1/2 = 1/4, so -,{X || Z) P . 

As above, we define P A for ^ A C X x Z x W. 

Example 3.1.4 

(1) 

A := {000,001,010,011}. Then P A (X = 0,Z = 0) = P A (X = 0,Z = 1) = 1/2, P A (X = 1,Z = 
0) = P A (X = 1, Z = 1) = 0, Pa(^ = 0) = 1, P4(^ = 1) = 0, P^(Z = 0) = P A (Z = 1) = 1/2, so 
X and Z are independent. 

(2) 

For A := {000,001, 110, 111}, we see that X and Z are not independent for P A . 

Considering possible decompositions of A into set produets, we are not so much interested how 
many continuations into W we have, but if there are any or none. This is often the case in logic, 
we are not interested how many models there are, but if there is a model at all. 
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Thus we define independence for A again by: 

(X || Z) A iff for all <jt E A there is p e A such that = a(X) and p(Z) = t(Z). 

The equivalence between probabilitistic independence, (X \ \ Z)p A and set independence, (X \ \ Z)a 
is lost now, as the second part of the following example shows: 

Example 3.1.5 

(1) 

A := {000,010,100,110} satisfies both forms of independence, (X \\ Z)p A and set independence, 
{X\\Z) A . 

(2) 

A := {000,001,010,100,110}. 

Here, we have P A {X = 0) = 3/5, P A (X = 1) = 2/5, P A (Z = 0) = 3/5, P A (Z = 1) = 2/5, but 
Pa(A = 0, Z = 0) = 2/5 ^ 3/5 * 3/5. 

Consider now (X \\ Z) A : Take <j,t E A, then for all possible values cr(X), t(Z), there is p such 
that p(X) — cr(X), p(Z) = t(Z) - the value p{W) is without importance. 

We have, however: 
Fact 3.1.2 

(X\\Z) Pa => (X\\Z) A . 
Proof 

Let er, t e A, but suppose there is no p e A such that = ct(X) and p{Z) = t(Z). Then 

P A (a(X)),P A {r(Z)) > 0, but P A ( ff (X), t(Z)) = 0. □ 



3.1.3.3 IxYxZ 

We consider now independence of X and Z, given Y". 
The probabilistic definition is: 

(X | Y | Z) P iff for all x £ X,y E Y,z E Z P(x,y,z) * P(y) = P(x,y) *P(y,z). 

As we are interested mainly in subsets A E X x Y x Z and the resulting P A , and combination of 
funetion fragments, we work immediately with these. 

We have to define (X \ Y \ Z) A . 

(X | Y | Z) A iff for all er, r £ A such that = r(F) there is p G A such that p(X) = a(X), 

p(Y) = a(Y) = t{Y), p(Z) = t(Z). 

When we set for y E Y A y := {a E A : a(Y) = y}, we then have: 
A y = {a(X) : er E A y } x {y} x {a(Z) : a E A y }. 

The following example shows that (X \ Y \ Z) A and (X \ \ Z) A are independent from each other: 
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Example 3.1.6 

(1) 

(X | Y | Z) A may hold, but not {X \ \Z) A : 

Consider A := {000,111}. (X \ Y \ Z) a is obvious, as only a goes through each element in the 
middle. But there is no 0x1, so (X | Z)a fails. 

(2) 

(X || Z) A may hold, but not (X \ Y \ Z) A : 

Consider A := {000, 101, 110, 011}. Fixing, e.g., in the middle shows that (X \ Y \ Z) A fails, but 
neglecting the middle, we can combine arbitrarily, so (X || Z)a holds. 

Example 3.1.7 

This example show that (X \ Y \ Z)a does not mean that A is some produet Ax x Ay x Az ■ 
Let A := {000, 111}, then clearly (X \ Y | Z) A , but A is no such produet. 

We have again: 

Fact 3.1.3 

Let|^CIxFx2, then {X \ Y \ Z)a a.nd (X | Y \ Z) Pa arc cquivalent. 



Proof 

Suppose there arc er, t G A such that cr(y) = t(Y), but there is no p G A such that p(X) = a(X), 
p(Y) = a(Y) = t(Y), p(Z) = t{Z). Then P A (a(X),a(Y)), P a (t(Y),t(Z)), P A (a(Y)) > 0, but 
P A (a(X),a(Y)=T(Y),T(Z))=0. 

Case 1: PaI^, 2/) or Pa{v, z) — 0, then Pa{x 1 y, z) — 0, and we are done. 

Case 2: P A (x,y), P A (y, z) > 0. By definition and prerequisite, P A {x,y) = card{ ' TgA ' e ^ ( ) ^ ) a: '' T(F)=i/ ^ 

. _ card{a(Z):aeA,a(Y)=y} , p , x _ card{aeA:a(Y)=y ,a(Z)=Z} _ card{a(X):aeA,a(Y)=y} 
~ card(A) AUU r AW,Z) — card(A) ~ card(A) ' hU 

_ card{aeA:c,(Y)=y} ^/r nramrar . p/„,\ _ cg£rfjo^4 i g(y}=j/} p / x _ 1 



ard{treA:tT(Y)=y) 
card(A)*card(A) 



so P A {y)*P A {x,y 1 z) = c Z d r { dT A tZ^}(M = PA{x iy )*P A {y,z) 



□ 



3.1.3.4 X x Y x Z x W 

The defmitions stay the same as for X x Y x Z. 

The equivalence between probabilitistic independence, (X | Y | Z)p A and set independence, (X 
Y | Z)a is lost again, as the following example shows: 



Heiteres 



17 



Example 3.1.8 

A := {0000, 0001, 0010, 1000, 1010}. 

Here, we have P A (X = 0,Y = 0) = 3/5, P A (X = l,Y = 0) = 2/5, P A (Y = 0,Z = 0) = 3/5, 
P A (Y = 0,Z = 1) = 2/5, P A (Y = 0) = 1, but P A (X = 0, K = 0, Z = 0) = 2/5 ^ 3/5 * 3/5. 

Consider now (X | Y \ Z) A : Take er, r G A, such that a(Y) = t(Y), then for all possible values 
a(X), t(Z), there is p such that p(X) = a(X), p{Y) = a{Y) = t{Y), p(Z) = t(Z) - the value 
p(W) is without importance. 

We have, however: 

Fact 3.1.4 

(x | y | z) Pa | y | 

Proof 

Let <j,t e A such that cr(F) = t(Y), but suppose there is no p e A such that p(X) = 
a{X), p(Y) = a(Y) = r(Y), p(Z) = r(Z). Then P A (a(X),a(Y)),P A (a(Y),T(Z)) > 0, but 
P A (a(X),a(Y),r(Z))=0. □ 



3.1.3.5 A remark on generalization 

The X, Y, Z, W may also be more complicated sets, themselves produets, but this will not change 
definitions and results beyond notation. 

In the more complicated cases, we will often denote subsets by more complicated letters than A, 
e.g., by S. 

3.1.3.6 A remark on intuition 

Consider set independence, where A := p(U), U = U\ x . . . x U n . Set (. . .) := (. . -)ß(u)- 

(1) (X || Z) means then: 

(1.1) all wc know is that we are in a normal Situation, 

(1.2) if we know in addition something definite about Z (1 model!) we do not know anything 
more about X, and vice versa. 

(X \ Y \Z) means then: 

(1.1) all we know is that we are in a normal Situation, 

(1.2) if we have definite Information about Y, we may know more about X. But knowing 
something in addition about Z will not give us not more Information about X, and 
conversely. 
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(2) The restriction to ß(U) codes our background knowledge. 

(3) Note that X U Y U Z necd not be /, e.g., W might be missing. We did not count the 
contimiations into W, but considered only existence of a continuation (if this does not exist, 
then there just is no such sequence). 

This corrsponds to multiplication with 1, the unit ALL on W, or, more generally, in the rest 
of the paper, with Ii-(xuyuz)- We may choose however we want, it has to be somewhere, 
in ALL. 

3.1.4 Basic definitions 
Definition 3.1.2 

If / is a function, Y a subset of its domain, we write / \ Y for the restriction of / to elements of 
Y. 

If F is a set of functions over Y, then F \ Y := {/ \ Y : / G F}. 

3.2 Probabilistic and set independence 
3.2.1 Probabilistic independence 

Independence as an abstract ternary relation for probability and other situations has been examined 
by W. Spohn, see |Spo80| , A. P. Dawid, see |Daw79] . J. Pearl, see, e.g., |Pea88j . etc. 

Definition 3.2.1 

(1) 

Let I 7^ be an arbitrary (index) set, for i E / Ui ^ arbitrary sets. Let U := H{Ui : i £ J}, and 
for X C / U x := U{U, : i £ X}. 

(2) 

Let P : V(U) — > [0, 1] be a probability measure. (We may assume that P is defined by its value 
on singletons.) 

(3.1) 

By abuse of language, for X C /, x E Ux, let P(x) := P({u EU : Vi G Xu{i) = x(i)}), so 
P(x) = P({u EU :u \ X = x}). 

Analogously, for X, Y C /, X n Y = 0, x E U x , V E Uy , let P(x, y) := P({u EU : u \ X = x and 
u \Y = y}). 

(3.2) 

Finally, for X,Y,Z C / pairwise disjoint, x E Ux, y E Uy, z E Uz, let P(x \ y) ;= P jf^ , 
P ( x I V' z ) : = P p(y,z) > etc - 

(We have, of course, to pay attention that we do not divide by 0.) 



Definition 3.2.2 
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P as above defines a 3-place relation of independence on pairwise disjoint X, Y, Z C I (X \ Y \ Z}j 
by 



f VxeU Xl VyeUY,Vz&Uz(P{y,z)>O^P{x\y) = P(x\y,z)), if Y^ 
i.e., P(x, y)/P(y) = P(x, y, z)/P(y, z),or 
P{x, y, z) * P(y) = P(x, y) * P{y, z) 

(X | y | Z) P < 

Vx G U Xl Vz e U z (P(z) > -> P(x) = P(x | z)), if Y = 

i.e.,P(x) = P(x,z)/P(z),or 
{ P(x,z) = P(x)*P(z) 

If Y = 0, we shall also write (X || Z) P for (X | Y \ Z) P . 

Recall from Section r3.1.3l (page fT^j) that we call this notion probabilistic independence. 

E.g., Pearl discusses the rules (a) — (e) of Definition 13.2.31 (page [T9|) for the relation defined in 
Definition ET2~2l (page HHJ) . 

Definition 3.2.3 

(a) Symmetry: (X \ Y \ Z) (Z | Y \ X) 

(b) Dccomposition: (X \ Y \ Z U W) -> (X \ Y \ Z) 

(c) Weak Union: {X \ Y \ Z U W) -> (X | F U W \ Z) 

(d) Contraction: (X \ Y \ Z) and (X \ Y U Z \ W) -> (X | F | Z U VF) 

(e) Intersection: (X | F U VF | Z> and {X \ Y U Z \ W) (X \ Y \ Z U W) 
(0) Empty outside: (X | Y | Z) if X = or Z = 0. 

Proposition 3.2.1 

If P is a probability measure, and (X \ Y \ Z)p defined as above, then (a) — (d) of Definition 13.2.31 
(pageHH) hold for (...) = (.. .)p, and if P is strictly positive, (e) will also hold. 

The proof is elementary, well known, and will not be repeated here. 
Doch ein Beispiel geben? 



3.2.1.1 A side remark on preferential struetures 

Boing a minimal element is not upward absolute in general preferential struetures, but in raked 
struetures, provided the smaller set contains some element minimal in the bigger set. 

Fact 3.2.2 

In the probabilistic Interpretation, the following holds: 
Let U be a flnite set, / : U -4 3? such that Vu G U.f(u) > 0. 
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For all AQU, such that 3a' G A.f(a') > and all a G A 
f A (a) '■— s{/(a') a i'eA} defines a probability measure on A. 

For BCi, dehne := E{f A (b) :beB}. Then the following property holds: 

(BASIC) For all D C ß c A C J7 such that 36 G B./(6) > f A (D) = f A (B) * / B (D). 

Proof 

For X C y C (7 such that 3y G Yl/fo) > we have /y(X) := : i e X} = g/S^y? • 

□ 



We have the following fact for /i generated by a relation: 
Fact 3.2.3 

Let U be a finite preferential strueture such that for AQU (J,(Ä) = A = 0. 
Thon 17 is ranked iff (BASIC) as defined in Fact GHH (page H2J) holds for f A . 

Proof 

Lct D C S C A C [/, S ^ 0. 

Case 1: D n = 0. Then f A (D) = 0. 

Case 1.1: If B n /x(A) = 0, then f A (B) = 0, and we are done. 

Case 1.2: Let £ n ß(A) ^ 0. If D n ^(5) = 0, then = 0, and we are done. Suppose 

D n jti(S) ^ 0, so there is d G D n m(-B), so t! e D n /i(i) by ß n ^ and rankedness, so 
f A (D) 0, contradiction. 

Case 2: Dn p{A) ^ 0. 

Thus, byDCß.ßn ß(A) ± 0, and by rankedness p(B) = B n So by D C ß again, D n 

MA) =ßn(ßn M (i)) = D n By definition, f A (B) := «^ggf , := «ff^ffl ■ 

f (r\\ _ card{p.(B)r\D) rp-i cqrd(/j(4)n 13) _ card(/j(A)ni3) ' " " " 



card(fj.(B)) ' ' card(^( J 4)) card(//(A)) card(fj,(B)) ' 

"<(=": 

Then there are a, 6, c G J7, where a is incomparable to &, and & -< c but a / c, or c &, but c a. 
We have four possible cases. 

Let, in all cases, A :— {a, b, c}. We construet a contradiction to (BASIC). 
Case 1, b -< c : 

Case 1.1, a is incomparable to c : Consider Z? := {a,c}, D := {a}. Then f A (D) — |, f A (B) = |, 

M£>) = i 

Case 1.2, c -< a (so -< is not transitive): Consider Z? := {a, &}, D := {a}. Then f A (D) = 0, 



f A (B) = 1, f B (D) = \. 
Case 2, c -< b : 

Case 2.1, a is incomparable to c : 

Consider B := {a,b}, D := {a}. Then / A (Z>) = ±, = §, / s (£>) = §. 

Case 2.2, a -< c - similar to Case 1.2. 

□ 



Remark 3.2.4 

Note that sets A C B, where ß(B) n A = 0, and sets where P(A) = have a similar, exceptional 
role. This might still be important. 

3.2.2 Set independence 

We interpret independence here diffcrcntly, but in a related way, as prepared in Section f3. 1.31 (page 
HU). 

Definition 3.2.4 

We consider function sets E etc. over a fixed, arbitrary domain / ^ 0, into some fixed codomain 
K. 

(1) 

For pairwise disjoint subsets X, Y, Z of /, we define 

(X | Y | Z) s iff for all f,g &T, such that / \ Y = g \ Y, there is h e E such that /i f X = / f X, 
h\Y = f \Y = g\Y, h\Z = g\Z. 

Recall from Section 13.1.31 (page [12^1 that we call this notion set independence. 
Y may be empty, then the condition / \ Y = g \ Y is void. 

Note that nothing is said about I — (X öY ö Z), so we look at the projection of U to X U Y U Z. 
When y = 0, we will also write (X \ \ Z)^. 

(X | y | Z)s means thus, that we can piece functions together, or that we have a sort of de- 
composition of E into a product. This is an independence property, we can put parts together 
independently. 

(2) 

In the sequel, we will just write (...) for (. . .)s when the meaning is clear from the context. 

Recall that Example 13.1.51 (page [TSl compares different forms of independence, the probabilistic 
and the set variant. 

Obviously, we can generalize the equivalence results for probabilistic and set independence for 
X x Z and X x Y x Z to the general Situation with W in Section [3. 1 .31 (page [T2l . as long as we do 
not consider the füll functions er, but only their restrictions to X, Y, Z, a \ (X U Y U Z) . As we will 
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stop the discussion of probablistic independece here, and restrict ourselves to set independence, 
this is left as an easy exercise to the reader. 



Heiteres 

3.3 Basic results for set independence 
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Notation 3.3.1 

In more complicated cases, we will often write ABC for (A \ B \ C), and -^ABC or — ABC if 
(A | B | C) does not hold. Moreover, we will often just write f(A) for / f A, etc. 

For (A U A' | B \ C), we will then write (AA')ßC, etc. 

If only singletons are involved, we will sometimes write abc instead of ABC, etc. 

When we speak about fragments of funetions, we will often write just A : a for a \ A, B : o = r 
for a \ B = t \ B, etc. 

We use the following notations for funetions: 

Definition 3.3.1 

The constant funetions C and l c : 
c (i) = for alH e I 
l c (i) = 1 for alH e I 

Moreover, when we dehne a funetion a : I — > {0, 1} argument by argument, we abbreviate a(a) = 
by a — 0, etc. 

Sometimes, we also give (a fragment of) a funetion just by the sequence of the values, so instead 
of writing a = 0, b = 1, c = 1, we just write 011 - context will disambiguate. 

Remark 3.3.1 

This remark gives an intuitive justification of (some of) above rules in our context. 
Rule (a) is trivial. 

It is easiest to set Y := to see the intuitive meaning. 

Rule (b) is a trivial consequence. If we can combine longer sequences, then we can combine shorter, 
too. 

Rule (c) is again a trivial consequence. If we can combine arbitrary sequences, then we can also 
combine those which agree already on some part. 

Rule (d) is the most interesting one, it says when we may combine longer sequences. Having just 
(X 1 1 Z) and {X \ \ W) as prerequisite does not sufhee, as we might lose when applying (X \ \ W) 
what we had already by (X \ \ Z). The condition (X \ Z W) guarantees that we do not lose this. 

In our context, it means the following: 

We want to combine a \ X with r \ Z U W. By (X 1 1 Z) , we can combine a \ X with r \ Z. Fix 
p such that p \ X = a \ X, p \ Z = r \ Z. As p \ Z = r \ Z, by (X \ Z \ W), we can combine 
p \ X U Z with r \ W, and have the result. 

Note that we change the funetions here, too: we start with a, r, then continue with p, t. 
We can use what we construeted already as a sort of scaffolding for construeting the rest. 



Fact 3.3.2 
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Zusammenhang (X | Y \ Z) mit Produkten. 



Proof 

Do 

□ 

We show now that above Rules (a) — (d) hold in our context, but (e) does not hold. 

Fact 3.3.3 

In our Interpretation, 

(1) rule (e) does not hold, 

(2) all (X | Y | 0) (and thus also all (0 | Y | Z)) hold. 

(3) rules (a) — (d) hold, even when one or both of the outside elements of the tripels is the empty 
set. 

Proof 

(1) (e) does not hold: 

Consider / := {x, y, z, w} and U := {Uli, 0100}. Then x{yw)z and x{yz)w 1 as for all a \ yw there 
is just one r this u can be. The same holds for x(yz)w. But for y = 1, there are two different paths 
through y = 1, which cannot be combined. 

(2) This is a trivial consequence of the fact that {/ : / : — > U} = {0}. 

(3) Rules (a), (b), (c) are trivial, by definition, also for X, Z = 0. In (c), if W = 0, there is nothing 
to show. 

Rule (d) : The cases for X,W, Z = are trivial. Assume a, r such that a \ Y = r \ Y, we 
want to combine er \ X with t \ Z U W. By (X \ Y \ Z) , there is p such that p \ X = a \ X, 
p \ Y = a \ Y = t \ Y, a \ X = p \ Z = t \ Z. Thus p and r satisfy the prerequisite of 
(X | Y U Z | W), and there is a such that atX = ( otX = crfX, afX = ( orr = crfy = rfr, 
a f W = t \ W. 

□ 

Next, we give examples which shows that increasing the center set can change validity of the tripcl 
in any way. 

Example 3.3.1 

(1) 

This example shows that neither (X \ Y \ Z) implics (X || Z), nor, conversely, (X \\ Z) implies 
(X\Y\Z). 

Consider / := {x, y, z}. 



(1.1) Let U := {(0,0,0), (1,1,1), (0,1,0), (1,0,1), (1,1,0), (0,0,1)}. Then {x \\ z), as all combina- 
tions for x and y exist, i.e. paths with the projections (0,0), (0,1), (1,0), (1,1). Fix, e.g., y = 1. 
Thon the paths through y = 1 are (1,1,1), (0,1,0), (1,1,0), but (0,1,1) is missing. So (x \ y \ z) 
does not hold. 

(1.2) Let U := {(0,0,0), (1,1,1)}. Then {x \ \ z) trivially fails, but (x \ y \ z) holds. 
(2) 

Consider / := {x, a, b, c, d, z}. 

Let S := {111111, 011110, 011101, 111100, 110111, 010000}. 
Then —ix(abcd)z, x(abc)z, ->x(ab)z. 

For -nx(abcd)z, fix abcd = 1111, then 111111,011110 g E, but, e.g., 011111 g E. 

For x(abc)z, the following combinations of abc exist: 111, 101, 100. The result is trivial for 101 and 
100. For 111, all combinations for x and z with and 1 exist. 

For -nx(ab)z, fix ab = 10, then 110111,010000 S E, but there is, e.g., no HOxyO £ E. 

See Diagram 13.3.11 (page [25]) 

□ 



Diagram 3.3.1 
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add paths different on ab, singletons on c, so they don't disturb on abc: 
seen on abc, the added paths are singletons, so they respect automatically 
(x | abc | z) 
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3.3.1 Example of a rule derived from the basic rules 

We will use the following definition. 

Definition 3.3.2 

Given £ as above, set 

£ M := {(X, Y, Z) : X,Y,Z are pairwise disjoint subsets of I, (X \ Y \ Z) E, but for all X' C X 
and all Z' C Z (X' \ Y \ Z) G E and (X | F | Z') GS}. 

We will sometimes write (X, X' | Y \ Z) etc. for (X U X> \ Y \ Z). 

When we write (X, X' \ Y \ Z) etc., we will tacitly assume that all sets X, X',Y, Z are pairwise 
disjoint. 

Remark 3.3.4 

(1) E M contain thus the minimal X and Z for fixed Y, such that (X \ Y \ Z) £ S. 

(2) By rule (b), for all (X \ Y | Z) G S, there is (X', Y, Z') EZ ß X C X', Z C Z' , unless all a, r 
such that <r \ Y = t \ Y can be combined. 

As the cases can become a bit complicated, it is important to develop a good intuition and rep- 
resentation of the problem. We do this now in the proof of the following fact, where we use the 
result we want to prove to guide our intuition. 

Fact 3.3.5 

Let E be closed under rules (a) - (d). Then, if (X,X',X" | Y | Z,Z',Z") G £„, then (X,Z' 
X', Y, Z" | X", Z) g E. 



Proof 



Diagram 3.3.2 
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X X 1 X" Y Z Z' Z' 



&X &x> Ox" &Y = Ty Tz Tz> Tz" 



(1) 

(2) 
(3) 
(4) 
(5) 
(6) 



Prerequisite: ax 1 = Tx> , &Y = Ty , oz" = tz>> 



The upper line is the final aim. Line (1) expresses that we can combine all parts except sx, by 
(X',X" | Y | Z,Z',Z"), which holds by (X,X',X" \ Y \ Z,Z',Z") e S^, by similar arguments, 
we can combine as indicated in lines (2) — (6). We now assume (X, Z' \ X', Y, Z" \ X", Z) 6 S. So 
we have to look at fragments, which agree on X', Y, Z". This is, for instance, true for (1) and (3). 

We turn this argument now into a formal proof: 

Assume 

(A) (X, Z' | X', Y, Z" | X", Z) G S, and 

(B) (X,X',X"\Y\Z,Z',Z")G^. 

(C) (X,X' \ Y\Z, Z', Z") by (B), see line (3) 

(D) (X | X', y, Z', Z" | X", Z) by (A) and rule (c) 

(E) (X | X', Y | Z, Z', Z") by (C) and rule (c) 

(F) (X\X',Y\Z',Z")by(E) & nd (b) 

(G) (X \X',Y\ X", Z, Z', Z") by (D) and (F) and (d) 
(K) (X | X',X",Y\ Z,Z',Z") by (G) and (c) 

(L) {X',X" | Y | Z,Z',Z") by (B), see line (1) 



(M) (Z,Z',Z" | X',X",Y | A) by (K) and (a) 

(N) (Z,Z',Z" | Y | X',X") by (L) and (a) 

(O) (Z,Z',Z" | Y | X,X',X") by (M) and (N) and (d) 

(P) (X,X',X" | Y | Z,Z',Z") by (O) and (a). 

So we conclude (X, X', X" \ Y \ Z, Z' , Z") G E, a contradiction. 

Comment: 

We first move Z', Z" to the right, and then X' ', X" to the left. 
Moving Z',Z" : 

We use X" (or Z) on the right, which not be changed, therefore we can use line (3), resulting in 
(C) (X,X' | Y | Z,Z',Z"), or, directly 

(C) (X,X'\Y \Z',Z"), again by E M , 
which is modified to 

(F) (X | X',Y | Z', Z"), so we have on the right Z', Z" which we want to move. 
We put Z' in the middle (Z" is there already) of (A), resulting in 

(D) (X | X',Y,Z',Z" | X",Z). 

Now we can apply (d) to (D) and (F), and have moved Z', Z" to the right: 

(G) {X \X',Y\ X",Z,Z',Z"). 

We still have to move X' and X" to the left of (G), and do this in an analogous way. 
□ 

Note that our results stays valid, if some of the X', X" , Z', Z" are empty. 
Aber resultat darf nicht links oder rechts sein. 

Corollary 3.3.6 

Let E be closed under rules (a) - (d). Then, if (X,X',X" \ Y,Y',Y" \ Z,Z',Z") G E M , then 
(X, Y', Z' | X', Y, Z" | X", Y", Z) E. 

Thus, if, for given Y U Y' U Y", (X, X', X" \Y,Y', Y" \ Z,Z', Z") £ E M! then for no distribution 
of A U A' U X" U y U Y' U F" U Z U Z' U Z" such that the outward elements are non-empty, 
(A, Y', Z' | X', Y, Z" | X", Y", Z) G E. 

Proof 

Suppose (A,y',Z' | A',y,Z" | A",y",Z) G E. Then by rule (c) (A, Z' \ X',Y,Y',Y",Z" \ 
X", Z) G E. Set Y 1 := Y U Y' U F". Then (A, Z' | A', y, Z" | A", Z> G E, and (A, X', X" | Y 1 \ 
Z, Z', Z") G E M , contradicting Fact [3X5l (page [27]) . □ 
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Validity of ABC, ACD, ADE, ABB ABB 




A 


B 


C* 


D 


E 






er 


er = t 






T 


ABE? 


(1) Pi 


er 


er = r 


T 






ABC 


(2) 92 


er 




Pl=T 


T 




ACD 


(3) P3 


er 






Pl=T 


T 


ADE 


(4) PA 


er 


er = r 






P3=T 


AEB 



3.4 Examples of new rules 
3.4.1 New rules 

Above rules (a) — (d) are not the only ones to hold, and we introduce now more complicated ones, 
and show that they hold in our Situation. Of the possibly infinitary rules, only (Loopl) is given 
in füll generality, (Loop2) is only given to illustrate that even the infinitary rule (Loopl) is not all 
there is. 

For warming up, we consider the following short version of (Loopl): 
Example 3.4.1 

ABC, ACD, ADE, AEB ABE. 
We show that this rule holds in all S. 

Suppose A : er, B : er = r, C : r, so by ABC, there is p\ such that 

A : pi = er, B : p\ = er = r, C : p± = t. So by ACD, there is pi such that 

A : p2 — er, C : p2 = pi = t, D : p2 — t. So by ADE, there is pz such that 

A : p3 = er, D : p$ = p2 = T, E : p^ — t. So by AEB, there is p^ such that 

A : pi — er, E : p 4 = p 3 = r, B : p 4 = r = er. 

So ABE. 

We abbreviate this reasoning by: 

(1) ABC : A : a, B : a = t, C : r 

(2) ACD : (1) + t 

(3) ADE : (2) + r 

(4) AEB : (3) + r 
So ABE. 

It is helpful to draw a little diagram as in the following Table |3T4~T1 fpage |3"Ü)) . 

We introduce now some new rules. 
Definition 3.4.1 
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• (Binl) 

XYZ,XY'Z,Y(XZ)Y' => X(YY')Z 

• (Bin2) 

XYZ,XZY',Y(XZ)Y' =► X(YY')Z 

• (Loopl) 

AB1B2, . . • , ABi-iBi, AB{Bi + i, ABi + \Bi + 2, ■ ■ ■ , AB n _\B n , AB n B\ => AB\B n 
so we turn AB n B\ around to AB\B n . 

When we have to be more precise, we will denote this condition (Loopl n ) to fix the length. 

• (Loop2) 

ABC, ACD, DAE, DEF, FDG, FGH, HFB HBF : 

The complicated strueture of these rules suggests already that the ternary relations are not the 
right level of abstraction to speak about construetion of funetions from fragments. This is made 
formal by our main result below, which shows that there is no finite characterization by such 
relations. In other words, the main things happen behind the screen. 

Fact 3.4.1 

The new rules are valid in our Situation. 

Proof 

• (Binl) 

(1) XYZ : X : a, Y : a = r, Z : t 

(2) XY'Z : X :a,Y' : a = t, Z : t 

(3) Y(XZ)Y' : (1) + (2) 
So X(YY')Z. 

• (Bin2) 

Let X : er, Y : er = r, V : er = r, Z : r 

(1) XYZ : X : a, Y : a = t, Z : t 

(2) XZY' : (1) +t 

(3) Y(XZ)Y' : (1) + (2) 
So X(YY')Z. 

• (Loopl) 

(1) AB 1 B 2 : A:a, B 1 :a = T, B 2 :t 

(2) AB 2 B 3 : (1) + r 



(i-1) ABi^Bt : (i-2) + T 
(i) AB i B i+1 : (i - 1) + r 
(i + 1) AB 4+1 ß t+2 : (»)+ T 

(n-1) AB n _iB n : ( n -2)+T 
(n) AB n Bi : ( n -l)+T 
So i4BiB„. 

• (Loop2) 
Let 

(1) ABC : A : er, ß : er = r, C : r 

(2) ACD : 1 + r 

(3) DAS : 2 + er 

(4) DEF : 3 + er 

(5) FDG : 4 + t 

(6) FG-ff : 5 + t 

(7) HFB : 6 + er 

So ff BF by B : <r = r. 

Note that we use here B : a = t, E : a = t, H : <t = t, whereas the other tripels are used for other 
funetions. 

□ 



Next we show that the füll (Loopl) cannot be derived from the basic rules (o) — (d) and (Binl), 
and shorter versions of (Loopl). (This is also a consequence of the sequel, but we want to point it 
out right away.) 

Fact 3.4.2 

Let n > 1, then (Loop\ n ) docs not follow from the rules (a) — (d), (0), (Binl), and the shorter 
versions of (Loopl) 

Proof 

Consider the following set of tripels LU L' over / := {a, b\, . . . , b n } : 
L := {aM 2 , ■ • ■ , ahh+i, • • • , ab n -ib n , a6„&i}, 
V := {$AB : An B = ß, AU B C I}, 

and close this set under symmetry (rule (a)). Call the resulting set A. 

Note that, on the outside, we have or singletons, inside singletons or 0. If the inside is 0, one of 
the outside sets must also be 0. 
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When we look at L, and define a relation < by x < y iff axy G L, we see that the only <-loop is 
bx <b 2 < ... <b n < fei. 

We show first that A is closed undcr rulcs (a) — (d) (see Definition 13.2.31 (page fTÜ)) ). 

(a) is trivial. 

(b) If W = or Z = 0, this is trivial, if W = Z, this is trivial, too. 

(c) If Z U W = 0, this is trivial, if Z U W is a singleton, so Z = or = or Z = W. Z = or 
W = $ are trivial, otherwise Z = contradicts disjointness. 

(d) Z = is trivial, so is W = 0, otherwise Z = W contradicts disjointness. 

(Binl) X = or Z = are trivial, otherwise X — Z is excluded by disjointness. So we are in L' 
for Y{XZ)Y'. So Y = or Y' = and it is trivial. 

Obviously, (Loopl n ) does not hold. 

We show now that all (Looplk), < k < n hold. 

The cases n = 1, n = 2 are trivial. 

Consider the case 2 < k < n. 

This has the form AB 1 B 2 , AB 2 B 3 , AB k - X B k , AB k B x AB x B k . 
If A = or ßfe = 0, the condition holds. 

So assume A,B k ^ 0. Thus, by above remark, descending to B k _\ etc., we see that all Bi ^ 0, 
1 < % < k. Thus, all prerequisites are in L. Moreover, A has to be a, which is the only elemcnt 
occuring repeatedly on the outside. Consider now the relation <' defined by U <' V iff AUV is 
among the prerequisites. We then have B\ <' B 2 <' ... <' B k <' B\, where all Bi are some bj, 
we see that the resulting <'-loop is too short, so the prerequisites cannot hold, and we have a 
contradiction. 

□ 



3.5 There is no finite characterization 

We turn to our main result. 
3.5.1 Discussion 

Consider the following simple, short, loop for illustration: 

ABC, ACD, ADE, AEF, AFG, AGB ABG - so we can turn AGB around to ABG. 
Of course, this construction may be arbitrarily long. 

The idea is now to make ABG false, and, to make it coherent, to make one of the interior conditions 
false, too, say ADE. We describe this Situation fully, i.e. enumerate all conditions which hold in 
such a Situation. If we make now ADE true again, we know this is not valid, so any (finite) 
characterization must say "NO" to this. But as it is finite, it cannot describe all the interior tripels 
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of the type ADE in a sufficiently long loop, so we just change one of them which it does not "see" 
to FALSE, and it must give the same answer NO, so this fails. 

Basically, we cannot describe parts of the loop, as the < 1 1 >-language is not rieh enough to express 
it, we see only the final outcome. 

The problem is to fully describe the Situation. 

3.5.2 Composition of layers 

A very hclpful fact is the following: 

Definition 3.5.1 

Let T,j be funetion sets over I into some set K, j G J. 

Let E := { / : I -> K J : f[i) = {</,•(*), j) : j G J, / ; G \,\ }. 

So any / G E has the form f(i) = / 2 (i), . . . , /„(i)), f m G E TO (we may assume J to be 

finite). 

Thus, given / G E, / m G E m is defined. 
Fact 3.5.1 

For the above E (A \ B \ C) holds iff it holds for all Ej. 

Thus, we can destroy the (A \ B \ C) independently, and collect the results. 

Proof 

The proof is trivial, and a direct consequence of the fact that / = /' iff for all components fj = /j. 
Suppose for some E^, k G J, ->(A \ B \ C). 

So for this k there are fk,f'k G such that f/t(i?) = f' k (B), but there is no fj! G Efe such that 
ft{A) - / fc (A), /^(ß) = f k (B) = f k (B), f k '(C) - f' k (C) (or conversely). Considcr now some 
h G E such that /i^ = J^, and h' is like ft, but h' k = f k , so also h! G E. Then = h'(B), but 

there is no h" G E such that h"(A) = h(A), h"(B) = h(B) = h'(B), h"(C) = h'(C). 

Conversely, suppose (A \ B \ C) for all Ej. Let h,h' G E such that h(B) = h'(B), so for all 
j G J hj(B) = h'^B), where hj G E^, b!- G Ej, so there are fe" G E^ with /i" (A) = 
^'(ß) = hj(B) = h'j(B), h'^{C) = h'^C) for all j G J. Thus, /i" composed of the h'j is in E, and 
h"(Ä) = h(A), h"(B) = h{B) = h'(B), h"{C) = h'(C). 

□ 



3.5.3 Systematic construetion 

Recall the general form of (Loopl) for singletons: 

a&iö 2 , • • • , abi-ibi, abib i+ i, ab i+ ib i+2 , ■ . . ,ab n -ib n ,ab n bi => abib n 
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We will fully describe a model of above tripels, with the exception of ab\b n and abib i+ i which will 
be made to fail, and all other (X \ Y \ Z) which are not in above list of tripels to preserve, will 
fail, too (except for X = or Z = 0). 

Thus, the tripels to preserve are: 

P := {abib 2 , . ■ . , abi-ih, (BUT NOT a&A+i) , ab i+1 b i+2 , ab n -\b n , ab n bi} 
We use the following fact: 

Fact 3.5.2 

Let X C I, card(X) > 1, S x := { a : I ->• {0, 1} : card{x G X : (j(ar) = 0} is even } 
Then -nABC iff A n X ^ 0, Cnl^f), ICiUßUC. 

Proof 

Suppose Anx ^<D,Cnx ^=<D, X CAUBUC. 

Take er such that card{x G X : cr(:r) = 0} is odd, then er Sx • As X % AuB, there is t e T,x such 
that er fiUß = r \ A U ß. As X % B U C, there is p G such that pf.BuC = crf.BuC. Thus, 
r \ B = p \ B. If there were a G such that a fiUß = r fiUß anda fßUC = p fßUC, 
then a \ A U B U C = er [AUBUC, contradiction 

Suppose AnX = 0orCnX = 0, orl^UßUC. We show ABC. 

Case 1: C n X = 0. Let er, r G Sx such that er f _B = r f ß. As C n X = 0, we can continue 
er f i4 U ß as we like. 

Case 2, iflX = fl, analogous. 

Case 3: I^iUßUC. But then there is no restriction in AU BöC. 

□ 

We will have to make ab\b n false, but ab n b\ true. On the other hand, we will make ab\b 3 false, 
but 06361 need not be preserved. 

This leads to the following definition, which helps to put Order into the cases. 
Definition 3.5.2 

Suppose we have to destroy axy. Then 

dmin(axy) := min{d({a,x,y},{a,u,v}) : auv has to be preserved } - d the counting Hamming 
distance. 

Thus, dmin(abib n ) = (as ab n b\ has to be preserved), dmin(abib 3 ) = 1 (because a6i6 2 has to be 
preserved, but not 06361). 

We introduce the following order defined from the loop prerequisites to be preserved. 



Definition 3.5.3 

Order the elements by following the string of sequences to be preserved as follows: 
h+i < h+2 -<•••-< 6 n - 1 -< b n ■< h. -< ö 2 -< • • • -< h-i < h 

Note that the interruption at abibi+i is crucial here - otherwise, there would be a cycle. 
As usual, ^ will stand for -< or = . 



3.5.4 The cases to consider 

The elements to consider are: a, 61, . . . , b n . 
Recall that the tripels to preserve are: 

P := {ab x b 2 , a&i-i&», (BUT NOT abib l+1 ) , ab l+1 b l+2 , ab n _ib n , a6 n 6i} 
The (X | Y | Z) to destroy are (except when X = or Z = 0) : 

(1) all {X\\Z) 

(2) all (X \ Y \Z) such that X UY U Z has > 3 Clements 

(3) all tripels which do not have a on the outside, e.g. bgc 

(4) and the following tripels: 

(the (0) will be explained below - for the moment, just ignore it) 
a6i& 3 , . . . ,aöi6„_i, ab x b n (0) 
ab 2 b\ (0), a& 2 &4, ■ ■ ■ , ab 2 b n 
a6 3 6i, a6 3 6 2 (0), 06365, . . . ,a6 3 6„ 

abibi, abib 2 , ALSO a&j&j+i, . . . , a6i6„ 

a6„_ 2 6i, , . . . , a&n-2&n-3 a b n - 2 b n 

ab n _ibi, , . . . , a6„„ 1 6„_2 (0), 
a6 n 6i, , . . . ,a6 n 6„_! (0) 

3.5.5 Solution of the cases 

We show how to destroy all tripels mentioned above, while preserving all tripels in P. 

(1) all (X\Y\Z) whcre X U Y U Z has > 3 elements: 

See Fact 13.5.21 (page[35 | with the X there with 4 elements, for all such X, Y, Z separately, so 
all tripels in P are preserved. 

(2) all (X \ Y \ Z) with 1 element: - 



(3) all (X || Z) : 

This can be done by considering Sj := {0 C , l c }. Then, say for et, c, we have to examine the 
fragments 00 and 11, but there is no 10 or 01. For (a \ b | c) this is no problem, as we have 
only the two 000, 111, which do not agree on b. 

(4) all (X | Y | Z) with 2 elements: eliminated by (X 1 1 Z) 

(5) all (X \ Y \ Z) with 3 elements: 

(5.1) a is not on the outside 

(5.1.1) a is in the middle, we need -*xay : Consider S with 2 functions, C , and the second 
defined by a = 0, and all u = 1 for u ^ a. Obviously, ^xay. Recall that all tripels to 
be preserved have a on the outside, and some other element x in the middle. Then 
the two functions are different on x. 

(5.1.2) a is not in xyz, we need ->xyz : Consider £ with 2 functions, C , and the second 
defined by a = y = 0, all u = 1 for u ^ a, u ^ y. As a is neither a; nor z, ->xyz. If 
some uvw has a on the outside, say u — a, then both functions are 000 or Ovw on 
this tripel, so uvw holds. 

(5.2) a is on the outside, we destroy ayz : 

(5.2.1) Case dmin(ayz) > 0: 

Take as S the set of all functions with values in {0, 1}, but eliminate thosc with 
a = y = z = 0. Then -^ayz (we have 100, 001, 101, but not 000), but for all auv with 
d({a, y, z}, {a, u, v}) > auv has all possible combinations, as all combinations for 
ay and az exist. 

(5.2.2) Case dmin(ayz) = 0. 

The elements with dmin — are: 

abib n , ab 2 bi, a6A-i, NOT ab l+1 b t , ab l+2 b i+ i. a6„_i&„_ 2 , ab n b n -i, they 
were marked with (0) above. 

£ will again have 2 functions, the first is always C . 
The second funetion: Always set a = 1. 

We see that the tripels with dmin = to be destroyed have the form ayz, where z 
is the immediate ^-predecessor of y in above order - see Definition l3.5.3l (page[3ü]). 
Conversely, those to be preserved (in P) have the form azy, where again z is the 
immediate -<-predecessor of y. 

We set z' = 1 for all z' < z, and y' — for all y' y y. Recall that z -< y, so we have 
the picture bi+i = 1, . . . , z = 1, y = 0, . . . , 6j = 0. 

Then —>ayz, as we have the fragments 000, 101. But azy, as we have the fragments 
000, 110. Moreover, considering the successors of the sequence, we give the values 
11, or 10, or 00. This results in the funetion fragments for auv as 111, or 110, or 100. 
But the resulting fragment sets (together with C ) are then: {000, 111}, {000, 110}, 
{000, 100}. They all make auv true. Thus, all tripels in P are preserved. 
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3.6 Systematic construetion of new rules 



This section is an outline - not a formal proof - for construeting a complete rule set for our scenario. 

We give here a general way how to construet new rules of the type ABC, DEF, =£> XYZ which 

are valid in our Situation. 

3.6.1 Consequences of a Single tripel 

Let (XX' X")Y (Z Z' Z") be a tripel, then all consequences of this Single tripel have the form 
X(X'YZ')Z (up to symmetry). 

Obviously, such X(X'YZ')Z are consequences, using rules (b) and (c). 

We now give counterexamples to other forms, to show that they are not consequences in our setting. 
We always assume that the outside is not 0. We consider A = B = C = {0, 1}, and subsets of 
A x B x C. 

(1) Y decreases: 

Consider {000, 111}, then ABC, but not A0C. 

(2) Z increases: 

Consider {000, 101}, then A$B, but not A$(BC). 

(3) X goes from left to right: 

Consider {000, 110}, then (AB)C, but not A(BC) 

(4) Y increases by some arbitrary W : 

Consider {000, 101, 110, 011}, then A0C, but not ABC. 

3.6.2 Construetion of funetion trees 

We can construet new funetions from two old funetions using tripels ABC, so, in a more general 
way, we have a binary funetion construetion tree, where the old funetions are the leaves, and the 
new funetion is the root. The form of such a tree is obvious, the tripels used are either directly 
given, or consequences of such tripels. In Examplc l3.6.3l (pagel49 p . for instance, in the construetion 
of p2, we used ACD, but we could also have used e.g. AC(DD'), for some D' . 

3.6.3 Derivation trees 

Not all such funetion construetion trees are proof trees for a rule T\ , . . . , T n => T, where the Tj 
and T are tripels. 

We have to look at the logical strueture of the tripels to see what we need. In order to show 
T = ABC, we assume given two arbitrary funetions er and r, which agree on B, and construet p 
such that on A p — er, on B p = a = t (the latter, a = r by prerequisite) , and on C p — t. We 
will write this as A : p = er, B : p = a = t, C : p = t. 
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Thus, we have no functions at the beginning, except er and r, so all leaves in a proof tree for 
Ti, . . . , T n => T have to be er or r. Moreoever, all we know about er and r is that they agree on 
B. Thus, we can only use some T[ = A'B'C on er and r if B 1 C _B. Likewise, in the interior of 
the tree, we can only use er \ B = r \ B, and, of course, all equalities which hold be construction. 
E.g., in Examplc 13.6.31 (page |49|. in the construction of p2, by construction of p\, C : p\ — r, so 
we can use ACD to construct P2 from p\ and r. 

At the root, we must have a function p of the form A : p — a, B : p = a = r, C : p = t. In Example 
I3.6.3l fpage l49l) . pi, at the root, was constructed using AEB from p^ and r. But we do not interpret 
P4 as AEB, but as ABE, which is possible, as A : p^ — er, B : p± — er — r, E : p^ = t. 

Intermediate nodes can be read as an intermediate result A'B'C by the same criteria: They must 
be functions p' such that A' : p' = a, B' : p' = a = r, C : p' = t and all B" such that B" : er = t 
used up to this node must be subsets of B' , as B' : er = t is the only hypothesis we then have. 

3.6.4 Universal trees 

3.6.4.1 A proof for XYZ 

The following is a universal proof for XYZ: 
It is a binary tree, whose leaves are all / or g. 

It uses as prerequisite only Y : f — g (and equalities constructed on the way). 
It makes /, g, and all other functions as different as possible. 

For instance, in Example fpage H9l . where we show that ABC, ACD, ADE, AEB =S> ABE, 

let us assume all sets A, etc. are singletons, we then set: er = 00000, r = 10111, p\ — 00122, 
P2 = 03113, p3 = 04411, p4 = 00551. So each new node has a new default value (2,3,4,5 here). 

Then we have no chance equalities, but only those we constructed. In particular, if we write the 
equalities with er, r for every pi thus constructed, we can read off the derived equalities. There are 
no others. 

The root of the tree must be a function h, which agrees on X, Y with /, and on Y, Z with g. 

This is a universal proof tree, as it works for any other pair /, g, and any other internally constructed 
Pi, too. 

3.6.4.2 Requirements for a proof for XYZ 

Suppose we have a proof for XYZ. 

We cannot assume we have anything but /, g to start with. 

The proof must be a binary tree, as the proof will be construetive, and we have no other construction 
principles but the combination of 2 functions. 

So it is a binary tree, with leaves /, g. 

It must also work for f,g maximally different, i.e. outside Y, they may be different. It must also 
work for the internal functions pi maximally different. So we can only assume that /, g agree on Y, 
and all other equalities must be by construction. Thus, it must also work for the universal choice 
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as done above. Assume now we have construeted this way h such that h = f on X, Y, h = g on 
Y, Z. This cannot be by coincidence, but it has to be a new funetion, construeted by the tree. 

3.6.4.3 Summary: proofs for XYZ 

To show XYZ, construet all universal trees for XYZ: 

Bcgin with /, g which agree at most on Y, makc them different everywhere eise. 

Make all internal nodes different from each other by enumerating them, and giving their number 
as default values to all other arguments. 

Check if the root can be seen as the construetion of a h s.t. h = f on XY, h = g on YZ. 
If so, we have a proof of XYZ. 

All proofs of XYZ have this form, as they must work for the universal tree. 
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3.6.5 Examples 



Hohen 'S 

Diagram 3.6.1 



Example \3. 6.1\ 



Pi 



P2 



T 



t Examples jU and \3lD\ 



Pi 



P2 



P3 
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Diagram 3.6.2 



Example{3J[M 

T 



pi t pi, using ABC - A : o~, B : a = r, C : t 



P2 t p2, using ACD - A : p\ = er, C : p\ = r, D 



P3 t P3, using ADE - A : pi — a, D : pi = r, E 



Pi, using AEB - A : p$ = er, E : ps = r, B 
Interpretation: ABE, common pari B : a 
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Diagram 3.6.3 



Example \3. 6.5\ 

TG TG 



Pi 



P2 



Pi 
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Explanation: 

By "prerequisite" of pi we mean the set X we used in the construction, where X : er — r. For 
instance, in the construction of p 2 in Example 13.6.11 (pagel48|). we used only that B U C : p\ = t 
by the construction of p\, no additional use of some er — r was made. 

By "common part" of pi we mean the set X such that X : pi = er = r. 
Example 3.6.1 

(Contraction), ABC, A(BC)D -> AB (CD): 
(See Diagram 13.6. II fpage l4"Tj) upper part.) 

• pi : A : er, B : er — t, C : r 

generated by ABC from er, r 
prerequisite B, 
common part: B 

pi can be interpreted as the (trivial) derived tripel ABC 

• p2 : A : pi = er, B : p\ = a — r, C : p\ — r, D : r 
generated by A(BC)D from pi, r 
prerequisite -, 

common part: B. 

pi can be interpreted as a derived tripel by AB(CD). 

pi can also be interpreted as a derived tripel by A(BC)D or A(BD)C. Note that these 
possibilities can be derived from AB(CD) by rule (c), Weak Union. 

Example 3.6.2 

(Binl), XYZ, XY'Z, Y(XZ)Y' => X(rr')^ : 
(See Diagram 13.6. II fpagel4"T j) lower part.) 

• p± : X : er, Y : er = r, Z : r 

generated by from er, r 

prerequisite Y 
common part: Y 

• ,02 : -X" : er, Y' : er = t, Z : r 
generated by XY'Z from er, r 
prerequisite Y' 

common part: Y' 

• p 3 : Y : pi = ct = r, X : /?i = p 2 = er, Z : p x = p 2 = r, Y' : p 2 = er = r 
generated by Y(XZ)Y' from p%, p 2 
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prerequisites - 
common part: YY' 

P3 can be interpreted as a derived tripel by X(YY')Z. 
Example 3.6.3 

(Loopl) ABC, ACD, ADE, AEB => ABE: 
(See Diagram [3X2](page |43]).) 

• pi : A : er, B : er — r, C : r 

generated by ABC from er, r 
prerequisite B 
common part B 

• p 2 : A : pi — er, C : p\ — r, D : r 

generated by ACD from pi, r 
prerequisite - 
common part - 

P2 cannot be interpreted as a derived tripel, as there was a prerequisite used in its derivation 
(B), but the common part in p2 is 0. 

• p3 similar to p2 ■ 

p 3 : A : p 2 — er, D : p 2 — T, E : r 
generated by ADE from p2, t 
prerequisite - 
common part - 

P3 cannot be interpreted as a derived tripel, as there was a prerequisite used in its derivation 
(B), but the common part in p^, is 0. 

• Pi ■ A : p 3 = er, E : p 3 = r, B : a = r 

generated by AEB from p3, r 
prerequisites - 
common part B 

Pi can be interpreted as the common part B contains all prerequisites used in its derivation. 
ABE is the only non-trivial derived tripel. 

Note that we could, e.g., also have replaced ACD by AC'{DC"), where C — C U C", using 
rule (c), Weak Union. 

Example 3.6.4 

BA(CD), DF(CE), (AB)(CD)(EF) => B(ADF)(CE): 
(See Diagram 13.6. II fpage l4T|) lower part.) 

This example shows that we may need an assumption in the interior of the tree (in the construetion 
of p3 , we use D : a = r). 



• pi : A : er = r, B : er, C : r, D : r 

generated by BA(CD) from er, r 
prerequisites ^4 
common part A 

• P2 : C : t, D : a, E : t, F : er = t 
generated by DF(CE) from er, t 
prerequisite -F 

common part F 

• p 3 : A: px = a = t, B : p 3 = a, C : pi = p 2 = t, D : pi = p 2 = o- = t, E : p 2 = t, 
F : p 2 = er = t 

generated by (AB)(CD)(EF) from px, p 2 
prerequisite D 
common part ADF 

So p 3 can be seen as the derived tripel B(ADF)(CE) (but NOT as (AB)(DF)(CE) etc., as 
DF does not contain ADF. 

Example 3.6.5 

(AA')BC, AD(CD'), (AB')C(C'D), (A'B')C(C'D'), (AD)(B'CC')(A'D'), BC(ADD') => 
A(BD)(CD'): 

(See Diagram 13.6.31 fpage l45|) .) 

This example shows that we may need an equality (here a and ß in the construetion of ps) which 
is not related to er and r. Of course, we cannot use it as an assumption, but we know the equality 
by construetion. 

a and ß will not be known, they are fixed, unknown fragments. 

• px : A : er, A' : er, B : er = t, B' : a, C : r 

generated by (AA')BC from er, r 
prerequisites B 
common part B 

• p 2 : A : er, C : r, C" : ß, D : a = r, D' : r 
generated by AD (CD') from er and r 
prerequisite -D 

common part D 

• ps : A : a, B' : a, C : t, C : ß, D : a = t 
generated by (AB')C(C' D) from pi and p 2 
prerequisite - 

common part D 



• p 4 : A' : er, B' : a, C : t, C : ß, D' : r 
Generated by (A'B')C(C'D') from Pl and p 2 
prerequisites - 

common part - 

• p 5 : A : a, A' : a, B' : a, C : r, C : ß, D : r, D' : r 
generated by (AD)(B'CC')(A'D') from p 3 and p A 

prerequisites - (note that equality on B' and C is by construetion of p$ and p^, and not by 
a prerequisite on a and r) 

common part: D 

• pß : A : u, £> : er = r, C : r, £> : er = r, D' : r 
generated by BC(ADD') from pi and ps 
prerequisites - 

common part: _BZ) 

Thus, p 6 may be seen as derived tripel A{BD){CD') 
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Chapter 4 

Subideal cases 



4.1 The problem and the outline of a Solution 
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Diagram 4.1.1 




One of the advantages of defeasible inheritance Systems is the ability to treat subideal cases. 

In the left hand diagram, (see Diagram 14 .1.11 (page 1551) ) . C inherits from B A and A' . This is the 
ideal case. In the right hand diagram, C does not have property A, the direct link C ^ A prevents 
this, but it still inherits A' from B, this is the subideal case. 

When we interpret A by "blond" , A' by "tall" , B by "Swede" , C by a subset of "Swedes" , which are 
not blond, we have the classical dark haired Swedes problem. Even dark haired Swedes should be 
tall. Preferential structures have a problem with this, as they do not say anything about subideal 
cases (where not all properties which hold in the minimal models, are valid). 

Inheritance Systems are modular in the following sense: the conditions which are inherited are 
clearly and separately spelled out, A and A' here. In preferential structures, we have - in principle 
- one tight knot of ideal cases, and no way to separate the different properties - without additional 
machinery. It is this machinery we want to examine here. 
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In inheritance Systems, in principle, all combinations are possible: AAA' AB AC, AAA' AB A ->C, 
. . . , ->A A ->A' A -*B A -iC. We might not mention all, but there is no contradiction to add nodes 
and arrows to make them visible. E.g., we can introduce D to one of the diagrams, with the 
arrows D — > A, D fr A', D — » B, D fr C, etc. So, the nodes code implicitly logically independent 
possibilities, and we use this idea for preferential structures. 

Suppose we have a language p,q,r, and a preferential structure where True |~ p A q A r. Intuitively, 
we want to "decompose" this into 3 rules: prefer p over ->p, q over ->q, r over —<r. Note that we can 
describe fx(True) by p A q A r, but also by the conjunction of 7 rules, excluding all other models one 
by one: ->(pq^r), etc. But these rules are not independent: There are cases with —i(pq-<r)A-i(p-<qr), 
but there is no case with (pq^r) A (p^qr). 

So, the Solutions seems to be, roughly: Find the finest (this exists, see Fact 3.4 in GS09bJ) 
independent factorization fi,...,f n describing ß(X), and for X' C X with X' n ß(X) — (when 
X' n ß(X) ^ 0, preferential structures take care of this), apply as many of the fi to X' as possible. 
The "as many" should probably be determined by the subset relation, and not by counting, as it 
is not sure that we are prepared to compensate the failure of one fi by the validity of another /;< . 

So, we "know" how to inherit properties to subideal cases in preferential structures. 

Another basic idea of inheritance Systems is specificity: Conflicts are, if possible, solved by speci- 
ficity. Tweety the penguin inherits egg-laying from birds, but not-flying from penguins, and not 
Aying from birds, as penguins are more specific than birds. We have to carry this over to our 
approach to preferential structures. The general Situation is as follows: We have a set X, and 
inherit from Y\, ... ,Y n factors fi t i, . . . , /i, mi , . . . , f n ,i, ■ ■ ■ , /i,m„, where the fij are the factors of 
jtx(ii). The Yi are partially ordered, and it seems natural to do some "merger" of the fcj, respecting 
priority determined by specificity. It is probably adequate to take an "axiom based" approach, 
taking a suitable subset of the fij, as formalisms Coming up with some compromise (e.g. deter- 
mined by some distance between models) are not only different from the inheritance formalism, 
but will probably give unexpected results. 

The Situation is more complicated than in inheritance, as the fij need not be independent when 
considering different i's. Some approach like the following is probably reasonable: 

(1) Consider the strongest Yi, ordered by specificity, and their 

(2) Consider all fij for those Yi. Identify minimal inconsistent sets of those foj, and erase all 
fij involved (this corresponds to direct seepticism in inheritance), until a consistent set of fij is 
obtained. 

(3) Consider the next strongest Y^, and add similar to step (2) new fi'j, while preserving the fij 
already chosen, and considering consistency together with the fij already chosen. 

(4) Etc., until all Yi are done with. 

4.2 Comments 

(1) We have here essentially a multi-valued approach. Not only classical validity as maximally 
strong, and preferential structure as next strongest, but, partially ordered by specificity, 
arbitrarily many levels of strength. 

(2) Note that preferential structures take care automatically of specificity for the ideal case, 
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basically, as we can handle all sets independently. Here, we have to add a formalism to 
handle specificity. 

(3) Higher preferential structures, see |GS08f] . can code our approach, but it is not sure that the 
coding would be natural. 

(4) Independence as discussed above might need to be refined. For instance, we might consider 
independence inside X when considering ß(X). 

(5) We can also ask whether we should not perhaps consider independence of X — instead 
of independence of ß(X). The following example gives an answer: 

Example 4.2.1 

Consider the language {p, q}. 

(1) Let ß(pq, ^pq,p^q 7 ~^p^q) — pq- We have two rules, p < -<p, q < -<q, and apply both. 

(2) ßipq, ~^pq,p^q, -ip~>Q) — (pq, ->pq,p->q), we avoid ~ip~>Q. But we do not avoid ->p and -<q, 
the rule is to avoid one of them. This does not seem to be such a good rule. In particular, 
factorization as above does not work, contrary to the Symmetrie case (1). 

(6) Note that we can see the factorization of fi(X) as an approximation of the ideal case ß(X) 
by a set of rules. 



Chapter 5 

Coding graphs by multisets 



5.1 Introduction 

This is a short comment on [Ä"GS09 . 

We examine here the coding of graphs by sets and multisets. 

In the following, we abbreviate a set of labels or elements like {a, b, c} by abc, etc. 

5.2 Even the case with simple (not multi) sets is quite 
complicated 

We consider here graphs generated by a subset of some powerset (with the natural ordering by 
inclusion) , and show that we need a certain number of atomic labels to represent them. 

The examples show that it is probably quite difficult to come up with a minimal number of elements 
- let alone working with multisets. Example 15.2.11 (page I57|) shows how complicated things can 
become. There is an interplay of chains up and down, and antichains involved. 

Thus, I am quite sceptical about a good Solution to the problem. 

Moreover, Example 15.3.11 fpage l55|) shows that an inductive construction is impossible. 
Example 5.2.1 

Consider the graph generated by the subset {abcdef,abcd,abc,ab,a,bcef,cef,ef,e,f} of 
P(abcdef). 

(A label of the node corresponding to) abcd has to have at least 4 elements, by a -< ab -< abc ~< abcd, 
and e, / have to have at least 4 elements less than the top node abcdef. 

All maximal antichains have 3 elements, e.g., {abcd, e, /}. The longest chains have 5 elements. 
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But we cannot use only 5 elements for labels, as any antichain containing a node with 4 elements 
can have size at most 2. 

5.3 There is no induetive algorithm by the natural ordering 
for the simple set case 

Example 5.3.1 

This example shows that, in general, an induetive procedure is impossible. 

Recall that, for a given set of n elements, the number of subsets of size m < n is („_ T ^| ! „ m! ■ 

Take now a strueture consisting of one antichain with 20 elements, and nothing eise. This can be 
represented with 6 elements and subsets of size 3, as = 20. 

Take a strueture with 4 antichains, each of size 20, and one above the other. Thus, the size of the 
representing sets will increase at least by 1 from the lowest antichain to the next, etc. 

As we are not allowed to look ahead, we begin again with subsets of size 3 of a set of 6 atomic 
labels for the lowest antichain. So the next antichain must consist of sets of at least 4 elements, 
the next of 5, the final of 6. Thus, we need at least 8 elements for representation, as = 28, 7 
elements will not do. 

If, however, we had begun with subsets of size 2 of a 7 element set, by = 21, we could have 
in the top layer only 5 element subsets, and this is again possible, so 7 elements will do. But we 
have to look at the whole strueture to see this. 
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We work with a set of atomic labels L := {a} U B, where a may occur several times, this will be 
written a n for n times a, etc. 

We have the following trivial fact: 

Fact 5.4.1 

(1) Let B' C B, then a n B' and a m B' are comparabel, as n < m or m < n. 

(2) Let B' C B" C B, then a n B' is not comparabel to a m B" iff n > to. 

Corollary 5.4.2 

(1) To code an antichain of size 2™, we need B of size at least n. 

(2) We can code an antichain of size 2" with B of size n. 

Proof 

(1) Suppose -B is smaller, then card(V(B)) < 2", so two elements of the antichain are coded by 
the same B' C B, contradicting Fact EU (page [59]) , (1). 

(2) Let (B 1 ) be the exponent of the (unique by Fact l5.4.11 (page l59"]) . (1)) a (ß ) _B'. Code the elements 
of the antichain by {a^ B ">B' : B' C B}, where B' C B" implies {B") < [B'). Then the codes are 
pairwise incomparable by Fact 15. 4H (page [5Ü ]) . (2). Note that (0) is the biggest exponent, and 
(B) the smallest. (The idea is that, if the ß-part of two codes is comparabel, we make the a-part 
comparabel in the other direction, so the whole codes are incomparabel.) 

□ 

Example 5.4.1 

Consider the structure X -< Y and an isolated Z. 

Obviously, we need at least one b. We may code this by a -< a 2 , b, or, by b -< ab, a 2 , and we have 
two, non-isomorphic, codings. 

In the following, we will code a bottom antichain of size 2™ by {a^ B ~>B' : B' C B}, where B has n 
elements. 

5.5 There is no inductive algorithm by the natural ordering 
for the multiset case 

We now show that an upward inductive algorithm, using the natural ordering, is impossible. For 
this, we discuss progressively more complicatcd examples. The last one, Example 15.5.31 (pagc lÜT]) . 
is perhaps the most interesting, as it shows that we have to consider an arbitrarily deep and wide 
substructure (with non-trivial inferior nodes), to see that a decision taken lower down cannot be 
upheld. 
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Example 5.5.1 

Consider an antichain of 4 elements at the bottom, say A, B, C, D. 
We might code this with the labels abc, a 2 b, a 2 c, a 3 . 

In the next level, we have an antichain of 2 elements, say X, Y, and they have both the same 
predecessors, say A, B. 

Suppose A was coded by a 3 , B by a 2 b. Then we can code X by a 5 b, Y by a 4 bc, and need no new 
label. 

Suppose now that A was coded by abc, B by a 2 b. Then X and Y must include a 2 bc, we may for 
instance make X a 4 bc, but now we have to introduce a new variable, say d, and make Y a 3 bcd. 

So, we have to look ahead. But it can be much more complicated. Take again above example. 
Suppose we have now two antichains, X,Y, and X',Y', one is above A,B, the other above C, D. 
Which one will have the abc! If X, Y is higher than X',Y', then we might have needed already 
d elsewhere, so we can use it without additional cost. But X might also be higher than X', Y' 
higher than Y. What shall we do? 

Example 5.5.2 

This example shows that even the initial step of coding 4 elements with 3 labels, as done above, 
might not always work: again, we have to look ahead. 

Consider again an antichain of 4 elements at the bottom, say A,B,C,D. Again, we might code 
this with the labels abc, a 2 b, a 2 c, a 3 . 

Suppose we have in the second layer one new point above each pair from A, B, C, D. One of the 
bottom nodes will be coded by a n bc, another by o" b, another by a n c. Suppose n! > n". Let X be 
above the bottom elements coded by a n b and a n bc. Then it will also be above the bottom element 
coded by a n c. But this is not wanted. 

Thus, in this Situation, we need a new label, say d, to code the element coded by a n bc. 

(We could also put the second layer nodes X on "stilts", so they will have arbitrary height, like 
a n b -< a n+1 b ~< a n+2 b -<...-< X, etc., so we have to climb up arbitrarily high to see the problem.) 

Fact 5.5.1 

Consider a bottom antichain with elements a^ B ^B' , wherc B' C B. 

Fix now D C B, let D' := B - D, and consider X := {a^'^D'E : E C D}. Then, of course, 
(£>') = (£>'0) > (D'E) for all E ^ 0. 

Let X = X' U X", where X',X" are disjoint and have the same cardinality, and introduce two 
new nodes, B' and B", such that B' y A' for all A' e X', B" >- A" for all A" e X", but for no 
A" e X" B' y A", and for no A' e X' B" y A'. 

Suppose without loss of generality a^ D ^D' e X' . Then there is b G D such that for no a^ D ' E ">D'E e 
X', b e E. (Othcrwise, by maximality of (D 1 ), all x e X would be below A' .) 

On the other hand, for cardinality reasons, there cannot be two such b e D. 

□ 
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Example 5.5.3 

Using Ti + 1 atomic labels, L := {a, bo, ■ ■ ■ , we can code a bottom antichain Xq^q, . . . , Xq^—i 

as follows: Work in the binary System. Set B := {b , . . . , 6„_i}, and code ß' C ß by c(-B') : = 
S{2 4 : bi € £?'}. This gives a natural total Order on V(B), and we use the inverse of this Order for the 
exponent of a. Thus, as it should be, (0) = (0, . . . , 0) is the biggest exponent, and (B) = (1, . . . , 1) 
the smallest exponent. 

In more detail, code Xq^ by a^H, where i is written in binary, i coding as above a subset of B. 
Thus, X ,o is coded by aW0, X ,i by . . . 1 = o^&o, X a , 2 by a(°- 10 )0 ... 10 = a^h, etc., 

up to X ' 2 n-i = a( 6 "- 1 - fc o)6„_i ... b . 

Thon create new nodes above the bottom level, etc., always grouping successive lower nodes to- 
gether, as follows: 

-^1,0, ■ ■ • > X 1 2 ™--L-i 

X\ } i y Xq^2, ^0,i*2+l 

Xk.a, • ■ ■ , X k 2 n-k_ 1 

Xk.i >~ Xk-l,i«2, Xk-l,i*2+l 

up to k = n — 1 (included) . 

The labelling of the new nodes is made by taking the union of lower labels. Our ordering of the 
exponents shows that this is possible, with exactly the relations as defined. See Diagram 15.5.11 
fpage IBTj) for an example with n = 3. 
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For instance, (d) is the highest exponent in the right half, but all exponents on the left half are 
bigger than (d). Thus, all nodes on the right half are below a^bcd, and none on the left is below 
a^bcd. For a^bc, all nodes on the right half contain d, so they are not below a^'bc, etc. We 
add now two additional nodes, a^bd, and a^bcd. The latter will have more nodes below it than 
intended - see the broken line in the diagram. Consider first the node labelled a^bd. The nodes 
below a^^bcd contain c, so they are not concerned, the same holds for those below a^bc. But 
it is impossible to add the node a^bcd : By (c) > (d) > (bd), we see that a^d -< a^bcd and 
a^bd -< a^ c 'bcd, a contradiction. 

This is no accident, it does not depend on the specific choice and distribution of the base labels, as 
we show now. "(labelled . . . )" refers to the examplc for n = 3, described in Diagram 15.5.11 (page 
EU). 

Consider, for an arbitrary labelling, X n _i_o (labelled by cS^bc) and X„_i i (labelled by a^bcd) 
(these are all which are on level n). One of them has to be above without loss of generality, 
let this be X n _io. Note that (0) has to be the strictly biggest exponent, otherwise we have no 
antichain. One of the atomic labels, say bj (d in the diagram) does not occur in the labelling of 
X n -i.a, otherwise, all bottom nodes would be below X n -\ß. For cardinality reasons, all others 
have to occur in the labelling of X„_i o, see Fact l5.5.T1 (page IBP)) . Moreover, bj occurs in all labels 
of the bottom nodes below X n -i^\, and all combinations of the other bk occur below X n -i^. In 
particular, we have a^^bj and a! Kbn ~ 1 "^bn-i . . . bo below X n -i^i, and, by the same reasoning, 
(bj) is the strictly biggest exponent below X n -i^. 

We split now X„_i j0 into X n _ 2 ,o (labelled a^b) and X n - 2 ,i (labelled a^bc) and repeat the 
argument, using again Fact 15.5.11 (page lSÖ]) . 

Suppose, without loss of generality, a^0 is below X„_ 2 ,o, so there must be some a^j'^bj* (labelled 
a^c) below X n ^ 2 .i- As a( b i''bji is not below X n _i t i, (bj>) > (bj). Split now X n -i t i into X„_ 2 ,2 
(labelled a^bd) and X„_ 2 , 3 (labelled a^bcd), and suppose without loss of generality a^bj is 
below X n ^2,2- Create a new node X (labelled a^bcd) above X„_ 2 ,i and JT„_ 2:3 . Then it is bigger 
than a,( b i''bj>, so its label has the exponent (bj>), but it is also above a^ > ™" 1 - b °^fe n _i . . . b (labelled 
a^ bcd ^bcd), so it is also above a^^bj, a contradiction by (bji) > (bj). But we detect this only at 
level n — 2, and we have to look at arbitrarily big subsets of the construction (in width and depth!) 
to find a contradiction. Thus, in a strong sense, a recursion is impossible. 



Note that we may modify above example, e.g., introduce a smallest node with label 0, and then 
lift the whole construction by adding everywhere a new set of labels, so we can embed it into an 
arbitrary diagram. Thus, the problem is not only with the base level. 
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We identify the different situations or objects (cameras, etc.) with propositional models, and the 
properties with propositional variables. The models may be defined only partially. 

To distinguish different models, we name them. Thus, we might have different models with the 
same properties, but with different names. We assume that all values can only be 0/1 (the bull 
example needs more values). 

I think there are different ways to treat the Situation: 



(1) We have only a local ranking, which is based on the values of the propositional variables. 
Based on this ranking, we try to complete the partially defined models. Gaps are permitted 
(undefined values), if there is a gap, we just forget this value for the ranking. If m -< m' ~< m", 
and m'(p) is undehned, then we try to complete it, so that m(p) < m'(p) < m"(p). 

(2) We have, in addition, a global ranking, where model m may be considered better than model 
m', for some external reason. 

In this case, we try to complete the undefined values aecording to local and global ranking. 

(3) We have, in addition, a ranking of the propositional variables, where p might be stronger 
than p', etc. In this case, we can work within one model, e.g., as follows: If m(p) is "positive" , 
and m(p') unknown, then we assume that m(p') is positive, too. 

We then see the following: 

(1) We have a strueture on the language, as 1 is better than 0. In the third case above, we have 
an order on the variables, too, so even more strueture. See p. 10 of our new book. 

(2) We may have a "soft" ranking, where some properties might be unknown, then the known 
properties determine the ranking. 

In this case, we fill in the unknown properties to coincide with the soft ranking. 

(3) I do not see why it is necessary to have only one (?) in the matrix. 

In particular, we may sometimes split 1 big matrix with two (?) into 2 small matrices with 
1 (?) each. 

(4) This way of ordering reminds me of the ordering in deontic logic, where situations may be 
better in several aspects. 

(5) It might be possible to generalize from elementary properties (propos. variables) to formulas. 

(6) The locality of reasoning makes it likcly that we have interpolation - if we find a nice way to 
express it. 

(7) What are the laws of this reasoning? If we modify the matrices, what stays constant, what 
changes, and how? 

(8) If we admit, say, 2 holes, we can examine Cumulativity: Is the result the same, when we fill 
both at the same time, or, first 1, then with the new matrix, 2? 
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(9) / think we can see this as a special case of preferential struetures: Replace the (?) with 
branching into 2 models, then prefer the one which fits in better. 

(10) Vielleicht kann ich auch pref. Modelle wie oben als Matrix sehen, und dann geometrisch 
arbeiten? 

(11) Mit Implikationen machen? 

(12) aus Bahnfahrt: 

• wieso nicht learning/detecting regularity? 

• hat an force bei a gedacht, nicht an min. labels, drum die vielen Fehler 

• Ist das nicht detecting causality? 

• detect order, tendency 

• Ist Ansatz 0/1 einzusetzen, um zu sehen, was besser passt, gerechtfertigt? Koennte das 
nicht eine Tendenz verschleiern? 

Dov, 

/ have a few questions and remarks, which we might discuss on the phonc: 

(1) The problem differs from an interpolation problem, as, in the latter, the order is give, here it 
has to be found. Correct? 

(2) Is finding regularities in one dimension (produet, or model) really the same as finding them in 
the other dimension (properties)? 

(3) I am not sure that the coding of "force" by a n is really what you want, and if the multiset 
approach is the right one. Do you have more on this? 

(4) Detecting regularities is traditionally a learning problem, / think. Is there a reason why this 
is not mentioned? Perhaps, we should work with someone from the learning Community? 

(5) You examine which of the possibilities give a better fit, or 1 in the place of?. Does this 
always correspond to finding regularities? This sounds like a stupid question, but / am not sure 
your answer is always true. If so, it might need a proof. 

Karl 
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Chapter 6 

Re-considering some principles of 
non-monotonic logics 

6.1 Introduction 

We try to take a fresh look at some fundamental ideas of non-monotonic logics. 
In particular, we 

(1) examine the step from "normally ..." to "normal" 

(2) differcntiate the consistency criterion of Reiter defaults 

(3) look at the "inference greed" of Reiter defaults, and other formalisms like inheritance, and 
give it an intuitive semantics through tentative theory formation, and connect it to inductive 
reasoning 

(4) describe that specificity is not always a good criterion 

(5) suggest a more modular approach a la inheritance 

(6) examine subset Systems more general than principal Alters used in preferential structures 

(7) describe how to generalize from propositional to first Order defaults 

(8) introduce a notion of validity of a default in a classical model, and describe how to use it to 
solvc conflicts and determine "good" models 

(9) finally, take a closer look at inheritance and motivate the use of dircct scepticism or of the 
intersection of extensions, and also re-consider the translation of inheritance to other Systems 
by examining their language. 
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We stress those aspects which seem elementary, "first principles" to us, and try to translate pro- 
cedural aspects into a more declarative content. The text is more questions and problems than 
answers. 



6.2 General remarks 

6.2.1 Not all defaults are about normality 

Medical students are told: "if you hear hoofbeat, think horses, not zebras". The meaning is, 
of course, first think of normal, usual situations, and not exotic illnesses. When we walk in 
the country, and hear the hissing of a snake, the advice might be: "think rattlc snakes, not 
garter snakes" , though the latter might be more common. The reason is, to treat first potentially 
dangerous situations. 

Both describe default reasoning, but for different purposes (they can, however, both be summarized 
as "useful" reasoning, the first to treat common situations, the second to avoid dangers). For the 
moment, we treat both as advice for acting (reasoning), or rules, and will write (hoofbeat:horse) 
and (hissing:rattler). They are justified by different rcasons, we have, so far, no formal justification 
or semantics, and no way to treat a System of such rules. But we are aware that the rules are 
"rough" , it might be a zebra, it might be a garter snake, after all. 

Note that the default rule we chose to apply may depend on the context. When we walk in 
the countryside, we use the cautious snake rule, when we observe from a safe position, we may 
use the rule that garter snakes are more common after all, so we conjecture it is a garter snake, 
(hissing:garter-snake) . 

6.2.2 Systems of rules, subideal cases 

We have many rules for birds, (birds:feathers), (birds:fly), (birds:lay-eggs), etc. When we write 
down all rules about birds, it might be that no Single bird satisfies all, the total set of rules for birds 
behaves like the lottery paradox. We may also have a mixture of rules with different motivations. 
In medical diagnosis, one rule might be to check for a common and not so serious illness, another 
rule to exclude a rare, but dangerous and rapidly developping one. We will probably decide about 
the latter first, then turn to the common illness, and if both are wrong, investigate further. Note 
that we do not have here just "normal" and "abnormal" cases, but three classes - just as we 
sometimes have three cases to consider for a mathematical proof. 

6.3 Clarification of notions: Normality and consistency 
6.3.1 Normality 

6.3.1.1 From "normally" to "normal" 

Thcre is an important - but often overlooked, see the author's own work - change from "normally, 
birds fly" to "normal birds fly" . The latter presupposes that normal birds, the ideal bird case, 
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exist, the former does not, it considers also partially normal birds. The ideal case need not exist, 
as the lottery paradox shows. The intersection of the bird sets with "normal" properties might be 
empty - or meaninglessly small. 



6.3.1.2 The behaviour of "normal" vs. finding normal elements 

Preferential struetures and their abstract treatment are about the normal case. They investigate 
the properties of normality, of the ideal case. They do not investigate the subideal case, where 
only some properties of the ideal case are satisfied. This is done, implicitly, by Reiter defaults, 
defeasible inheritance, etc., where we preserve as many normal properties as possible. Preferential 
struetures also do not investigate which Clements (in the first order case) are normal, or as normal 
as possible. This is done by first order Reiter defaults, where as many elements as possible are 
made as normal as possible. 



6.3.2 The consistency criterion for Reiter defaults (and other for- 
malisms) 

A Reiter default is allowed to fire unless the consistency criterion is violated. But the inconsistency 
might be against a classical background theory, or against another default, or a combination of 
other defaults, etc. In particular, criteria like speeificity might be important. Thus, a whole theory 
of elimination of inconsistencies may be necessary to solve confiiets - as it is brought to light in 
defeasible inheritance. In the first order case, which elements are normal, and to which degree, is 
also solved by an, implicitly, complicated theory. 

Note that preferential struetures have total control of minimal elements, so there is no room for 
downward inheriting properties - unless we want to work with special struetures - and potential 
confiiets are obvious. 



6.4 The implicit extension of conjectures 
6.4.1 Inference greed 

Reiter defaults (and, e.g., inheritance networks) are "inference greedy" in the following triple sense: 

(1) The default (: <f>) will "fire", even if we know already tp, (: (f) is implicitly broken down to 
subsets - contrary to preferential struetures, where we do not have this homogeneity. 

(2) In the default set {(: (j>), (: 4>')}, if (: 4>) cannot fire (as -xf> holds), (:</>') may still be able 
to fire - in preferential struetures, we know nothing beyond classical logic about not totally 
normal, ideal, elements, whereas defaults can also treat the subideal case. 

(3) Open defaults (: 4>{x)) make as many elements as possible normal, i.e. satisfy (j){x). 
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6.4.1.1 A justification 



It seems difficult to find a semantics in the usual sense for this behaviour. Why should the 
world "feel" a pressure for normality? Why should there be a direction towards maximal possiblc 
normality in the world? 

The only idea the present author had was to give an (informal) semantics of both the world and 
our theory building about the world. My, certainly naive, idea is in the platonic tradition. We 
make a theory about the world, knowing that it is only an approximation, but try to extend it 
as far as possible (until contradictions - to be elaborated, see above, Section 15.3.21 (page l69])). 
The basic assumption is that the world is regulär, and we can, in principle, describe it in simple 
terms, but our description will not be pcrfcct. It is an assumption about homogeneity of the world, 
and independence of properties, unless proven otherwise. (It is also an exploratory approach: we 
explore the world, and try to be conservative, in the sense of simplicity. As such, it has much in 
common with inductive reasoning.) 

Thus, we have a pragmatic view, make as many defaults hold as possible, also for subsets, and 
for as many elements as possible in the first Order case. We do not seek "best" knowledge, about 
absolutely normal cases, but, more modestly, distinguish between levels of knowledge, probabilities, 
like truth values in inheritance networks. This can then be formalized by a simple relation of 
"better" between models and elements, forgetting the human element of extending knowledge. 



6.4.2 Remarks on specificity 

The specificity criterion for deciding conflicts is one of the basic tenets of non-monotonic reasoning. 
If Tweety is a penguin, we conclude that the more specific Information, that penguins don't fly, 
will win over the more general Information that birds fly. If there is no conflict, we assume that 
subsets behave like supersets - see above. 

The specificity criterion is fine for Classification, as we assume that many properties will be inherited 
from super- to subclass, but not all. Subclasses may have a somewhat modified "building plan". 
But specificity is irrelevant for other properties - for example for "destructive" properties. We 
will not try to find out if dead penguins can still walk, once we understood that dead animals 
cannot walk. Something in the "construction" of the animal has gone wrong, and we do not 
assume normal life functioning any more. Thus, we have to distinguish properties which "feel" 
specificity, and those which do not. (Likewise, we will not investigate how the dead specimens of a 
newly discovered bird behave - we know it already, it is a "transverse" property, and no inductive 
reasoning is necessary.) 

This distinction goes beyond classical logic, as we distinguish different types of predivates (or 
propositional variables, in the propositional case). 

"Penguin" is not a capacity like Aying, but a complex of properties. Similarly, we do diagnosis, 
e.g., for an illness, with distinctive properties, which serve as indicators. 

Note that specificity can be seen as an approximation: a more specific set B is a better approx- 
imation to A than less specific set C : A C B C C But we do not really work with specificity 
as a set-wise relation: Tweety, a kolibri, a blackbird, is a small set, but it seems useless. We 
need "well defined" small sets, like penguins, we need property-wise or class-wise (like penguin) 
approximation. 



6.4.3 Induction 
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The justification for the inference greedy behaviour of defaults makes a connection to inductive 
logic plausible. Inductive reasoning is also inference greedy, we try to push our knowledge as far as 
possible. Of course, the reasoning goes upward, towards the more general case, and not downward 
to subsets. Still, one should explore further if there are common points. In particular: 

(1) Is induction only inverse to the downward extension of knowledge of defaults, or are there 
deeper differences? 

(2) Can we transfer results and rules from one domain to the other? 

(3) Can we dehne inductive reasoning by the generalization which is best extended downward in 
default reasoning (or vice versa)? So one will be a reflection of the other? 

(4) Can we learn from "real" science, how physicists, or researchers in life sciences, determine 
if a theory is thought to be sufhciently corroborated? What does "practical philosophy of 
science" say? How do they exclude "disturbing influences" ? What does this mean for default 
reasoning? Can we reflect this to default reasoning? 

(5) Can the degree of inconsistency of Section 16.81 (page [74l) be generalized to induction? 

6.5 Modularity 

An attractive feature of inheritance Systems is their modularity. Modularity corresponds also to 
the description of Information as approximation. We have several "aims", building blocks of a 
description, and put them together as well as possible, in a principled way, based on a basically 
modular world itself. 

If we take this idea seriously, we have not one big language and theory, but small fragments 
of non-monotonic theories, and - non-monotonic - Operators on those fragments, which combine 
them, similar to a revision of non-monotonic logics by non-monotonic logics. (Combining diffcrcnt 
languages is, e.g., a multiplication of models, etc.) 

6.6 Subset Systems beyond principal filters 

Prcfercntial structures (in the minimal version) generate principal filters on sets, F(X) := {A : 
ß(X) C A C X}, together with coherence properties between filters over different sets, X, X' , etc. 
They have an intuitive Interpretation by the notion of size. The minimal elements are the ideal 
cases, and everything non-minimal is negligeable, or small. 

The lottery paradox and the limit version of preferential structures motivate to consider more 
general filters, or even weak filters. 

Dcfault Systems also generate subset Systems. E.g., {(: </)),(: ip)} generate the "good" subsets 
{m : m \= (/)}, {m : m \= tp}, {m : m \= (j) A ip}, and perhaps {m : m \= <j> V ip}. Considering the 
default System {(: cp V ip), (: ip)} shows that {{m : m |= cp V ip}, {m : m \= ip}} and {{m : m (= tp}} 
should not be considered equivalent. In the latter, only {m : m \= ip} is "good", in the former, 



also {m : m \= (p v "0} wm be considered good, though not as good as {m : m \= ip}. This is 
intuitive, as the dcfault (: tp) might not be able to fire, but the default (: <fi V tp) may - the System 
{(: <p v VOi VOI i s not equivalcnt to the System {(: ip)}. If we interpret tp as the ideal case, 
then both describe the same ideal case, or limit, but not the same subideal cases. (This is like 
contrary-to-duty conditionals.) 

Let Af(X) denote such abstract Systems. 

The following questions arise about Af(X) : 

(1) What are reasonable closure properties of M{X)1 

• A first idea is to proceed as for deontic logic: Take all model sets derived from singlc 
defaults, and close under union and intersection. 

• If {~]N(X) = 0, we should probably consider only non-empty intersections. 

• Should X E Af(X)? Probably not. 

• Is a System like {A,X — A} reasonable? Are Systems with \JAf(X) = X reasonable? 

• Can different closure properties code different intuitions? 

• In which cases does Af(X) describe an approximation of ideal cases? 

(2) Can we compare two different Af(X), Äf'(X), o.g., if VA e N{X)3Ä E Af'{X).A' C A, then 
Af'(X) is at least as sharp as N{X) is? 

(3) What are reasonable coherence conditions between Af(X) and Af(X') 

(4) Can we find an intuitive interpretation of such Systems, as we can interpret ß(X) by size? 

(5) Can we generate such Systems locally by a relation, as we did for ß{X)l By higher order, 
reactive, relations? 

(6) If Af(X) is generated by a probability (as for the lottery paradox), are there special laws, 
resulting from Substitution and sums? 

E.g.: if {x, y} # Af(X), {x, y'}, {x', y} G N{X), then {x' , y'} e M{X)1 

(7) Are there intuitive ways to combinc A e M{X) with A' e Af(X') to A x A' £ N{X x X'), 
etc.? 

Given Af(X), we can compare x, x' e X : 
Definition 6.6.1 

Definc U(x) := {A e Af(X) : x E A}, and S(x) := f| U(x). 

Let x -< x' iff S(x) C S(x') (alternatively: card(S(x)) < card(S(x'))) 

This generalizes the comparison in preferential relations, minimal elements are not comparable 
among each other. 

Remark 6.6.1 

(1) This is a special case of a preferential relation, as minimal elements stay minimal, it is about 
subideal Clements. 
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(2) It is robust under weakenings like in {TV \<j>) , N (<fi V ip)}- 

(3) What are the properties of the resulting relation, coherence conditions? 

(4) Can we find a complctc set of such properties (representation)? 

(5) Transitivity of defaults is treated correctly: for (cf> : ip), (ip : p), the best (/)-models satisfy tp, 
and the best ^-models satisfy p, so the overall best </>-models satisfy p. This is not surprising, as we 
pushed defaults into the order, where we work with the best possible elements, as in preferential 
structures. 

Remark 6.6.2 

A remark on reasoning dynamics: 

The füll System P has no dynamics, because of Cumulativity. In the lottery paradox, once we 
concluded that n will now win, n' has become more likely to win. But, it could also be otherwise. 
If we conclude that a bird will probably fly, the Aying birds might even be more likely to have 
feathers than the not Aying ones. Thus, drawing conclusions might also make further conclusions 
more secure. In inheritance, upward chaining adds new conclusions, but they become less certain, 
as longer paths of reasoning offer more possibilities of attack. 

Thcre does not seem to exist a fully general theory of the dynamics of reasoning - but this might 
also be too general a problem. 

6.7 From propositional to first order logic 

In propositional logic, every (complete) possibility exists exactly once. In Ist order logic, a predicate 
p(.) may have 0, 1, many elements, likewise combinations of predicates, like p(.) A ~~<q(.)> The 
combinations of properties correspond to propositional models. Here, we treat these combinations, 
as if they were classical models. Then, we put as many elements into the "good" combinations, 
and compare all models as in the propositional case. Thus, we try to put as many penguins as 
possible into the non-Aying set, and the others into the Aying set. So, given a Axed universe U, we 
prefer those structures where more elements are "good" . 

(1) More precisely, as in the propositional case, all cases are possible, like b(x) A/(x), b(x)A-if(x), 
etc., but they need not have the same cardinality. E.g., b(x) A might have 3 elements, 
b(x) A f(x) 1 dement, or, vice versa. We prefer the latter, as the "better" case b(x) A f(x) 
has more elements than the "less good" case b(x) A -*f(x). 

Again, this is still up to Interpretation for the right preference relation. This preference 
relation should certainly satisfy: If, in structure S, every x in the universe satishes a default 
set X x which is at least as good as the default set X' x satisAed in structure 5", then S should 
be preferred to S'. More complicated relations may be considered, e.g., taking into account 
cardinalities, like: More x in S satisfy "good" default sets than in S', etc., see Section [B~8l 
(page[74]). 

(2) Suppose we have birds, penguins, sparrows. Penguins cannot be Aying birds, but sparrows 
should be. Sparrows are not penguins, so, Aying sparrows are better than not-Aying sparrows. 
Flying sparrows satisfy both defaults (not being penguins, see Section f6~8l (page l74 |) ). but 
not-Aying sparrows violate the "Ay" default, and satisfy the (penguin : —'fly) default, so they 
are worse. We choose sparrows so that they fall into the normal birds set, or, more precisely, 
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among the most normal birds. Names should be treated as unary predicates, interpreted by 
as normal as possible elements. 

6.8 Validity of defaults and the best models 

Consider the propositional case, and a non-nested dcfault (cp : ip), i.e., <p and ip are classical 
formulas. We treat the default similarly to the classical implication <f> — V ij), and dehne for a 
classical model m : 

m |= ((p : ip) iff m |= —i<j) or m |= <j> A ip. 

We rehne this. In classical logic, validity is absolute, or 1. We differentiate the strength of 
validity for defaults: 

m \= (<p ■ ip) holds with strength 1 (the strength of m \= -xf>) if m \= -xf>. 
m \= (cp : ip) holds with strength M(<p) if m \= <p A ip. 
m \= (cp ■ ip) fails with strength M(cp) if m \= <j> A ->ip. 

The strength M((p) takes care of specihcity - the smaller M(<p), the bigger the strength, this gives 
a partial Order on strength. 

For a füll picture, we have to extend this dehnition to nested defaults. 
Example 6.8.1 

Consider birds, penguin, ravens. Birds (including ravens) hy, penguins don't, penguins are birds, 
etc. A penguin Twcety which does not fly, fails (bird : fly) with strength "bird", and satishes 
(penguin : ->fly) with strength "penguin". A penguin Tweety' which flies, satishes (bird : fly) 
with strength "bird", and fails (penguin : -<fly) with strength "penguin". Tweety is a better 
model of the whole theory than Tweety' is, as Tweety fails for less strong defaults than Tweety' 
does. Blacky, the Aying raven, satishes (birds : fly) with strength "bird", and (penguin : -'fly) 
with strength 1, as it is no penguin. Thus, Blacky is the best model of the theory (among Tweety, 
Tweety', Blacky). 

We turn to the treatment of contradictions, this can be done in several ways, dehning a partial 
relation between models. We outline requirements and possibilities, considering a theory T with 
classical Information (p, . . . and default Information (cp : tp), . . . 

(1) Models which contradict classical Information <p are the worst. 

(2) Models which contradict neither classical nor dcfault Information are the best. 

(3) Fix a classical model m. Let S(m) be the (multi-) set of strengths of defaults which m fails. 
E.g., if m \= (p A cp' , m \= -iip A —*p A -'ip' , and T consists of the defaults (<p : ip), (cp : p), 
(<p' : ip'), then S(m) = {M(<p), M(<p), M(<p')}. (We suppose that ^ tp -H- p - this has to be 
refined to account for ip,p which are not independent.) 

(4) A comparison of m and m' will be via a comparison of S(m) with S(m'). 
There are many possibilities: 
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(4.1) We can treat S(m) as a set, and forget multiple occurrences of the same strength. This is 
probably unsatisfactory, as we will treat a modcl which fails one default the same way as 
a model which fails many defaults - as long as they have the same strength. It results in 
usual preferential struetures, which are unable to treat subideal cases. m \= <p A ip A ->p 
will be then considered equivalent to m' |= <f> A -^tp A ->p (when we consider just the 
defaults (<p : ip), (<p : p)). 

(4.2) We can consider f]S(m), and if (~}S(m) C f]S(m'), conclude that m fails in a worse 
way than m' does. 

(4.3) We can combine (4.2) with a multiset approach, and "count" only if (4.2) will not decide 
between m and m'. 

(4.4) We can use any other reasonable way to Order a set of partially ordered multisets. 
The following questions arise: 

(4.1) These are special preferential rclations, do additional properties hold? 

(4.2) Is there an abstract description, characterization, of such relations? 

(5) The first order case: 

We use above partial order between propositional models to treat (open) FOL defaults. A 
propositional model corresponds to a subset of the universe, like X := {x : p(x) A -*q(x)}. If 
two such subsets X, X' of the universe are comparable by above order, we prefer the model 
which has more elements in the preferred X - all other things being equal. 

This is then a straightforward extension of the propositional case, and handled in the same 
spirit. 

Example 6.8.2 

(1) Consider the default set (p : q), (p : -*q). 

Any -ip-model satisfies both defaults, any p-model one, but not the other. So the globally 
best models are the ->p- models, the best models of T = {p, (p : q),(p : ~^q)} are all p- models. 

(2) This also gives an answer to the inconsistent default (p : -ip) : The globally best models are 
the ^p-models, the best models for p are all p-models, being all equally bad. 

(3) Consider the default set (: p), (: q), (: r), and the background theory (pAqA^r)V(rA^pA^q). 
Then the model pAqA^r satisfies 2 defaults, the model r A ->p A ->q only one. We decide by 
cardinality, so the former model is better. 

(We need here that the defaults are "decomposed" , e.g., not (: p A q) instead of (: p), (: q). 
A finer treatment might be needed to cover cases like (: p A q).) 

(4) We treat the Nixon diamond similarly. [p : r), (q : ->r). Consider T := {pAq, (p : r), {q : -r)}, 
the two models p A q Ar and p A q A ->r are equally good (or bad), so none is preferred - we 
are directly seeptical, we have no result about r. 

(5) We use speeificity. For (b : /), (p : ->f), p — > b, we have the globally best models: -16-models 
and b A -*p A /-models (they satisfy both defaults), among the p-models, (by p — > b, b holds) 
all fail one default, and the ^/-models are better by speeificity. 

(6) Consider (: (j>), (: (f> V if>). The best models are those which satisfy <j>, the second best satisfy 
4> V ip but not <p, the worst satisfy neither. 
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We have treated (4> : ip) above similarly to the classical implication <fi —> ip. It is natural to try 
and extend this, by translating "somehow" (<f> : tp) to tj> — >■ (^normal(<p) V ip), But this is then an 
asymmetric "OR" , as in most cases, normality and ip will hold. In particular, we will prefer to try 
and make normality hold, e.g., in a mechanical proof System. 

We may extend this idea to asymmetric theory revision, where K * (cp V tp) is preferably achieved 
by making <p true. 

6.9 Inheritance 

Remark 6.9.1 

Inheritance diagrams allow to treat subideal cases, but only with Information of differing strength; 
penguins still inherit "feathers" from birds, although they cannot fly. Preclusion might override 
weaker Information. But we have an unrestricted AND for Information of same maximal strength 
(the "Garbage In" rule). So we cannot treat the Lottery Paradox. 

6.9.1 Direct seepticism vs. intersection of extensions 

We have to distinguish whether inheritance Systems are to speak about the State of the world, 
or about our knowledge of the world. We may not know whether Nixon was a paeifist or not 
(direct seepticism), but he was one of the two, so one of the extensions represents reality (leading 
to the intersection of extensions approach). Thus, the distinetion between State of the world 
and knowledge, and between elements and sets, provides an answer to the direct seepticism vs. 
intersection of extensions question. 

In addition, if "Nixon" were a set, and not one element, there is even a third possibility: (almost) 
all Nixons are paeifists, (almost) all Nixons are not paeifists, and there is no majority for either. 
(In knowledge terms, we may know the latter holds, the latter or the first holds, etc., ) 

Remark 6.9.2 

The existence of copies in classical preferential struetures may code our ignorance - we do not know 
which x is smaller than x', we only know that it is one of the x £ X. We have all possibilities 
in one strueture, this expresses seepticism. Alternatively, we may work with many struetures in 
parallel, see [SGMRTOÖ] . this corresponds to an extensions approach. 

6.9.2 The language of inheritance 

(1) The (implicit) language of inheritance is not sets and arrows, but the atoms are arrows, and 
the results are valid paths. Only in a latter step, valid paths are transformed into (soft) 
arrows. When p — > q is an arrow in the diagram, neither q — >• p nor g^p need be in the 
language. Thus, a comparison (soundness and completeness) with the reasoning with small 
sets etc. must only be about the information which can be expressed in the language of the 
diagram. Here, q •/} p may well be a result of reasoning with corresponding small sets, but 
we cannot compare it, as it is not in the language. 
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(2) We see this (the language) also by the fact that we may have several paths resulting in the 
samc conclusion, but one might be destroyed by further reasoning, and the other not. 

(3) We can dehne the language using admissible paths (concatenations of arrows pointing in the 
same directions, with at most one negative arrow, at the end), and/or their conclusions. 
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